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Abstract 

We report on a new approach to a global CKM matrix analysis taking into account most recent 
experimental and theoretical results. The statistical framework (i?fit) developed in this paper 
advocates frequentist statistics. Other approaches, such as Bayesian statistics or the 95% CL 
scan method are also discussed. We emphasize the distinction of a model testing and a model 
dependent, metrological phase in which the various parameters of the theory are estimated. 
Measurements and theoretical parameters entering the global fit are thoroughly discussed, in 
particular with respect to their theoretical uncertainties. Graphical results for confidence levels 
are drawn in various one and two-dimensional parameter spaces. Numerical results are provided 
for all relevant CKM parameterizations, the CKM elements and theoretical input parameters. 
Predictions for branching ratios of rare K and B meson decays are obtained. A simple, predictive 
SUSY extension of the Standard Model is discussed. 
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1 Introduction 



Within the Standard Model (SM) , CP violation is generated by a single non- vanishing phase in 
the unitary Cabibbo-Kobayashi-Maskawa (CKM) quark mixing matrix V [|^, ^. A useful pa- 
rameterization 1^, ^] follows from the observation that the elements of V exhibit a hierarchy in 
terms of the parameter A = \ Vus\- Other parameters of this representation are A, p and r/, where 
CP violation necessarily requires r/ 7^ 0. The parameters A and A are obtained from measure- 
ments of semileptonic decay rates of K mesons and B meson decays involving b ^ c transitions, 
respectively. Constraints on p and r] are obtained from measurements of semileptonic B decays 
yielding \Vub\ and the ratio |T/ufe/V^fe|- Standard Model predictions of B'^ and B'^ oscillations, 
and of indirect CP violation in the neutral kaon sector, depend on CKM parameters; therefore 
measurements of these observables provide constraints in the p — r] plane, albeit being limited 
by theoretical uncertainties coming mainly from long distance QCD effects. Finally, in the era 
of the B-factories, it will be possible, for the first time, to assess the CP-violating angles q, /3 
and 7 of the Unitarity Triangle (UT) expressing the unitarity relation between the first and the 
third column of V. 

The first goal of a global CKM fit is to probe the validity of the SM, that is to quantify the 
agreement between the SM and the experimental information. Furthermore, one intends to per- 
form a detailed Metrology, that is to find allowed ranges for CKM matrix elements and related 
quantities, assuming the SM to be correct. Finally, within an extended theoretical framework, 
one may search for specific signals of new physics, by estimating the additional theoretical pa- 
rameters. 

Analyzing data in a well defined theoretical framework ceases to be a straightforward task when 
one moves away from Gaussian statistics. This is the case for the theoretically limited precision 
on the SM predictions of the neutral K and B mixing observables and, to a lesser extent, for 
the semileptonic decay rates of B decays to charmed and charmless final states. The statisti- 
cal approach (i?fit) developed in this analysis allows a non-Bayesian treatment of the, a priori 
unknown (i.e., not statistically distributed), theoretical parameters and theoretical systematics 
of measurements. The ensemble of the statistical analysis is realized in the program package 
CkmFittei^ A detailed description of the methods it uses, with emphasis on the new method 
denoted -Rfit which is proposed here, and the presentation of state-of-the-art results are the 
subject of this paperQ. 

The paper is organized as follows: after recalling the most common CKM parameterizations, 
we comprehensively discuss the statistical framework of the analysis, starting with the intro- 



duction of the relevant likelihoods in Section 3.1, followed by a definition of the three analysis 



steps: metrology, model testing and probing for new physics. We then recall the principles of 
the 95% CL Scan scheme [^] and of the Bayesian approach |^, |7| in Section ^ (see also Refs. [§] 
and references therein for a tentative collection of publications on the CKM matrix and related 
topics). We work out their limitations and motivate further going ideas, while never leaving 
non-Bayesian grounds. This is followed by a discussion of the treatment of experimental and 
theoretical systematics in Section |5[ In Section ^ we present a compendium of the input mea- 



^ CkmFitter is a framework package that hosts several statistical approaches to a global CKM fit, such as 
i?fit, Bayesian techniques and the 95% CL Scan method. It is available as public share ware. Please, contact the 
authors for more information. 

^ Visit the CkmFitter web page to find plots, reference links, detailed descriptions and more: 
ht t p : / / www . slac . st anford . edu/ ~ laplace / ckmfitt er .html 
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surements, their predictions in the framework of the SM, and discuss the theoretical parameters 
and their uncertainties used in the analysis. We display our fit results as confidence levels in 
various parameter spaces in Section ^ and produce tables of constraints on all relevant CKM 
parameters, constraining measurements and theoretical inputs, and predictions of rare K and B 
decays. Within our statistical approach, we perform a test of the goodness of the theory and dis- 
cuss the effect of a simple, predictive Minimal Supersymmetric extension of the SM. Deepening 
statistical discussions on some crucial issues of the analysis are given in the appendix. 



2 The CKM Matrix 

Invariance under local gauge transformation prevents the bare masses of the leptons and quarks 
to appear in the SU{3) x SU{2) x U{1) Lagrange density of the SM. Instead, the spontaneous 
breakdown of electroweak symmetry dynamically generates masses for the fermions due to the 
Yukawa coupling of the fermion fields to the Higgs doublet. Since the latter has a non- vanishing 
vacuum expectation value, the Yukawa couplings g give rise to the 3x3 mass matrices 

M. = ^, (1) 

with i = u{d) for up(down)-type quarks and i = e for the massive leptons. The transformation 
of the Mi from the basis of the flavor eigenstates to the basis of the mass eigenstates is realized 
by unitary rotation matrices Ui, where 

^M(d,e)Af„(d,e)f^I(rf,e) = diag (m„(rf_e),"ic{s,M),"it(6,T)) ■ (2) 

For the Lagrange density in the basis of the mass-eigenstates the neutral-current part remains 
unchanged {i.e., there are no flavor changing neutral currents present at tree level), whereas 
the charged current part of the quark sector is modified by the product of the up-type and 
down-type quark mass matrices, 

V = UuUl , (3) 

which is the CKM mixing matrix. By convention, V operates on the —1/3 charged down-type 
quark mass eigenstates 

^ Vud Kts Vub \ 

V= V,d Vcs Vdo (4) 
V Vtd Vts Vth j 

and, being the product of unitary matrices, V itself is unitary: 

VV^ = Id . (5) 

There exists a hierarchy between the elements of V both for their value (the diagonal elements 
dominate) and their errors (since they dominate, they are better known). Unitarity and the 
phase arbitrariness of fields reduce the initially nine complex parameters of V to three real 
numbers and one phase, where the latter accounts for CP violation. It is therefore interesting to 
over-constrain V since deviations from unitarity would reveal the existence of new generation(s) 
or new couplings. 

The charged current couplings among left-handed quark fields are proportional to the elements 
of V . For right-handed quarks there exist no W boson interaction in the SM and the Z, photon 
and gluon couplings are flavor diagonal. For left-handed leptons the analysis proceeds similar 
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to the quarks. 



There are many ways of parameterizing the CKM matrix in terms of four parameters. It is the 
purpose of this section to summarize the most popular representations. 

2.1 The Standard Parameterization 

The Standard Parameterization of V is taken to be the one proposed by Chau and Keung 0], and 



advocated by the PDG [10|. It is obtained by the product of three complex rotation matrices, 
where the rotations are characterized by the Euler angles 9i2, ^13 and 623, which are the mixing 
angles between the generations, and an overall phase 6: 

C12C13 S12C13 si3e~'^ 

V = I -S12C23 - Ci2S23Sl3e*'' C12C23 - Si2S23Sl3e'^ S23C13 | (6) 
S12S23 - Cl2C23Sl3e*'^ -C12S23 - Sl2C23Sl3e"^ C23C13 

where Cij = cosOij, Sij = s'mOij for i < j = 1,2,3. This parameterization has the considerable 
advantage of being exact in the sense that it strictly satisfies the unitarity relation (|5|). 

2.2 The Wolfenstein Parameterization 

Following the observation of a hierarchy between the different matrix elements, Wolfenstein 1^] 
proposed a simple expansion of the CKM matrix in terms of the four parameters A, A, p and 
^ (A = \Vus\ ~ 0.22 being the expansion parameter), which is widely used in contemporary 
literature. Using the convention of Ref. Q one has 

/ i„ iA2- A AX^ip-irj) \ 

V^l -Xll + A^X^p + iT^^^)] l-iA2-iA4(l + 4A2) (7) 

V AA3[l-(p + zr;)(l-iA2)] -AA2 [l + A2(p + ir;-i)] l^^A^X^J 

It is obtained from Eq. via the definitions 

Sl2 = A , 

S23 = AX"" , (8) 
sise-'^ = A\\p-ir,), 

and is valid to the order 0{\\\^) ~ 0.01%. 

2.3 Phase Invar iance 

It was shown by Jarlskog |]l^] that the determinant of the commutator of the up-type and 
down-type unitary mass matrices (Q) reads 

det[M„,Mrf] = -2iF„FrfJ , (9) 

with Fu, Fd, being 

Fu{d) = {mt(b) - iTT'c{s)){mt{b) - mu{d)){mc(s) - . (10) 

The phase-convention independent measure of CP violation, J, is given by 

3 



Im 



yij^klVuVkj = X! ^ikm^jln , (H) 



m,n=l 
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Figure 1: The resettled Unitttrity Triangle in the Wolfenstein pttrameteriztttion. 



with the CKM matrix elements Vij and €ikm being the total antisymmetric tensor. One repre- 
sentation of Eq. (|ll|) reads, for instance, J = ImlVudVcs^us^cd}- ^ non- vanishing CKM phase 
and hence CP violation necessarily requires J 7^ 0. The Jarlskog parameter expressed in the 
Standard Parameterization (^) reads 

J = Cl2C23C?3Sl2S23'Sl3Sin5 , (12) 

and, using the Wolfenstein approximation (^), valid to the order 0(|A|^'^), one finds 

J = A^X^ri - 10^^ . (13) 

The empirical value of J is small compared to its maximum of 1/(6^/3) ~ 0.1 showing that CP 
violation is suppressed as a consequence of the strong hierarchy exhibited by the CKM matrix 
elements. It is the remarkable outcome of Eq. (^) that CP violation requires not only J to be 
non-zero, but also the existence of a non-degenerated mass hierarchy. Equal masses between at 
least two generations of up-type or down-type quarks would necessarily remove the CKM phase. 



Phase convention invariance of the ^/-transformed quark wave functions is a requirement for 
physically meaningful quantities. Such invariants are the moduli \ Vij\'^ and the quadri-products 
^ij^kl^u^kj i^-f-i the Jarlskog invariant J (11)). Non-trivial higher invariants can be refor- 



mulated as functions of moduli and quadri-products (see, e.g., Ref. |jT3[). Indeed, Eq. ( [Tl| ) 
expresses the fact that, owing to the orthogonality of any pair of different rows or columns of V, 
the imaginary parts of all quadri-products are equal up to their sign. We will use phase-invariant 
representations and formulae throughout this paper ^. 

2.4 The Unitarity Triangle 

The allowed region in p — 77 space can be elegantly displayed using the unitarity triangle (UT) 
described by the resettled unitarity relation between the first and the third column of the CKM 
matrix 

VcdV:, v^v;, v^dV:, ■ ^ ' 

Note that twice the area of the non-rescaled UT corresponds to the Jarlskog parameter J. This 
identity provides a geometrical interpretation of the phase convention invariance of J: a rotation 
of the CKM matrix rotates the UT accordingly while leaving its area, and thus J, invariant. It 



^ We are indebted to K. Schubert for drawing our attention to this point. 
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Figure 2: Constraints in the {p, ff) plane for the most relevant ohservahles. The theoretical 
parameters used correspond to some "standard" set chosen to reproduce compatibility. The 
dashed lines indicate the rectangle on which we concentrate in the following for the global fit. 



is the remarkable property of the UT (|14|) that its three sides are governed by the same power 
of A and A 

+ 1 + ^ =^ , (15) 



AA3 



AA3 



which predicts large CP asymmetries in the B sector |12|. As a comparison, the corresponding 
UT for the kaon sector is almost flat 







VudVl 



us ^ KdK*. ^ VtdV^l ^ A A^X^ 

1- + — Y + -'^ "I ^ — 

A A 



VcdV^l V,dV,% v^v*. 



(16) 



exhibiting small CP asymmetries. The UT ( |l4|) is sketched in Fig. Q in the complex (p, ff) plane 
{p = p{l — A^/2), f/ = r/(l — A^/2)) of the Wolfenstein parameterization^. The sides and Rt 
(the third side being normalized to unity) are given by 



Ri, 



p2 _|_ 



(17) 



The length of the vector to the triangle apex is given by 



1 + 



V^aV* 



that the replacements p ^ p(l — A /2) and — » — A /2), where T/ud = 1 — A /2 — 0(A ), improves the precision 
of the apex coordinate in the Wolfenstein approximation 
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Rt 



and the three angles, a, /?, 7, read 



cb 



1 — /o)^ + 



(18) 



arg 



13 = arg 



Vcdv:, 



VtdV;, 



7 = arg 



VcdVX 



(19) 



where 7 = 5 in the Standard Parameterization. The angles and sides of the UT obey the 
trigonometric relation, sina : sin/3 : sin7 = 1 : ■ Rt- 

The relations between the angles and the p, fj coordinates are given by 



sin2a 
sin2/3 
tan7 



2fj{f - p(l - p)) 

{n^ + {I - pYM + p^) ' 

2r?(l - p) 
f?2 + (1 - p)2 ' 

P ' 



(20) 
(21) 
(22) 



A graphical compilation of the most relevant present and future constraints sensitive to the 
CP violating phase 5 is displayed in Fig. |2[ We simplify the representation by assuming a mea- 
surement of sin2a whereas, in principle, the UT angle a can be directly determined from S ^ Svr 
decays. For the third UT angle 7, we assume a measurement of sin7, even though charmless B 
decays may allow a non-ambiguous determination of 7. A more detailed elaboration of future 



measurements is given in Ref. |14]. Some "standard" set of theoretical parameters is used for this 
exercise in order to reproduce compatibility between the constraints. The present experimental 
values for the observables and their dependence on the CKM matrix elements in the framework 
of the SM are discussed in Section |^. 

Over-constraining the unitary CKM matrix aims at validating or not the SM with three gen- 
erations. The interpretation of these constraints requires a robust statistical framework which 
protects against misleading conclusions. The next section describes to some detail the statistical 
approach applied for the analysis reported in this work. 
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3 The Statistical Framework 

We are considering an analysis involving a set of iVoxp measurements collectively denoted by 
a^exp = {2;exp(l), • • • ) 3;exp(-^exp)}) described by a set of corresponding theoretical expressions 
a^theo = {^^theoll); • • • ) ^theo(-^exp)}- The theoretical expressions Xtheo SLie functions of a set of 
-^mod parameters ?/mod = {ymod(l); • • • , ymod(-^mod)}- Their precise definition is irrelevant for 
the present discussion (c./., Section ^ for details) beside the fact that: 

• a subset of A^theo parameters within the Vmod set are fundamental, and free, parameters of 
the theory (i.e., the four CKM unknowns in the SM, the top quark mass, etc.) and are 
denoted ytheo, where ytheo = {ytheo(l), ■ ■ ■ , ytheo(A'theo)}- 

• the remaining A^'qcd = -^mod — -^theo parameters, which appear due to our present inability 
to compute precisely strong interaction quantities {e.g., Jb^, B^, etc.), are denoted yqcD, 
where i/qcd = {yQCD(l), • • • , yQCD(A^QCD)}- 

There are three quite different goals the analysis aims at: 

1. Within the SM, to achieve the best estimate of the ytheo parameters: that is to say to 
perform a careful metrology of the theoretical parameters, for later use. 

2. Within the SM, to set a confidence level (CL) which quantifies the agreement between 
data and the theory, whole. 

3. Within an extended theoretical framework, e.g., Supersymmetry, to search for specific 
signs of new physics by pinning down additional fundamental, and free, parameters of the 
theory. 

These three goals imply three statistical treatments all of which rely on a likelihood function 
meant to gauge the agreement between data and theory. 



3.1 The Likelihood Function 

We adopt a x^-like notation and denote 

X^iVmod) = -21n(£(ymod)) , (23) 
where C, the likelihood function (it is defined below) receives contributions of two types 

•^(ymod) — •^exp(2^exp ^theo (ymod)) Aheo(2/QCD) • (24) 

The first term, the experimental likelihood £exp; 

meaiSures th.6 aigrcGinGnt between x^xp and 
Xtheo) while the second term, the theoretical likelihood -Ctheo expresses our present knowledge 
on the yqcD parameters. 



It has to be recognized from the outset that the notation of Eq. ( p3[ ) is a commodity which 
can be misleading. In general, denoting "Prob" the well known routine from the CERN library, 
one cannot infer a CL from the above value using 

CL = Prob(x'(ymod),iVdof) , (25) 



1 



e-*/2t^dof/2-i _ (26) 



V2^T{Ndoi/2) 

X (Vmod) 

This is because neither £exp nor £theo (they are further discussed in the sections below) are built 
from purely Gaussian measurements: 



9 



• In most cases £exp should handle experimental systematics, and, in some instance, it has 
to account for inconsistent measurements. 

• In practice, £theo relies on hard-to-quantify educated guessworks, akin to the ones used to 
define experimental systematics, but in most cases even less well-defined. 

The first limitation is not specific to the present analysis and is not the main source of concern, 
here. The second limitation is the most challenging: its impact on the analysis is particularly 
strong with the data presently available. The statistical treatment advocated below, denoted 
-Rfit, is designed to cope with both of the above limitations. Notwithstanding its attractive 
features, the Rfit scheme does not offer a treatment of the problem at hand free from any 
assumption: an ill-defined problem cannot be dealt with rigorously. However, the RUt scheme 
extracts the most out of simple and clear-cut a priori assumptions. 

The alternative statistical treatments discussed in Section ^ differ from the -Rfit scheme by the 
procedure used to define the CL from the above x^i or by the content and interpretation of 

•^theo • 



3.1.1 The Experimental Likelihood 

The experimental component of the likelihood is given by the product 



•^cxp(^exp 3;theo(ymod)) — -^expl^) j (^'^) 



where the individual likelihood components account for independent measurements]^. 



The Likelihood Components: Ideally, the individual likelihood components -Cexp(«) 
would be pure Gaussians 



■^exp (^) 



1 



2vr(Texp(^) 



exp 



1 / ^exp(0 •^theo(0 



O-exp(^) 



(28) 



th 



each with a standard deviation given by the experimental statistical uncertainty (Texp(*) of the i 
measurement. However, in practice, one has to deal with additional experimental and theoretical 
systematic uncertainties. 



Experimental Systematics: An experimental systematics is assumed to take the form 
of a possible biasing offset, the measurement could be corrected for, were it be known. Their 
precise treatment is discussed in Section ^. There, a natural extension of the usual method of 
adding linearly or in quadrature statistical and systematic uncertainties is proposed. 



Theoretical Systematics: Theoretical systematics, when they imply small effects, are 
treated as the experimental ones. However, because most theoretical systematics imply large 
effects and affect in a non-linear way the Xtheo prediction, most of them are dealt with through 
the theoretical likelihood component Aheo (c/.. Section 3.1.2|) . 

^ Features marked by *■*' in the following item list are not issued in the analysis presented in this work, but 
may become important in future CKM profiles. 
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Model Dependent Measurements'^*^ : When theoretical systematics cannot be expressed 
as a dependence on not perfectly known parameters, but are expressed as a set of measurements 
based on alternative models, labelled by the index m, {e.g., the exclusive \ Vub/Vcb\ measurement 
exhibits such a model dependence) >Cexp(^) is defined by 

^cxpii) = Ce^pii,m) , (29) 

and m is treated as an additional yqcD parameter, taking only discrete values. 



Identical Observables and Consistency: When several measurements refer to the same 
observable {e.g., various measurements of Am^) they have to be consistent, independently of 
the theoretical framework used for the analysis. 

Similarly, when several measurements refer to different observables which are linked to the same 
ytheo parameter, e.g.., \Vud\ and \Vus\, or determinations of |T4fe| stemming from different ob- 
servables, or measurements of sin2/3 obtained from similar B decays, one may decide to overrule 
possible disagreement by imposing the measurements to be consistent. By doing so, one is de- 
liberately blinding oneself from possible new physics effects which may have revealed themselves 
otherwise. Clearly, such overruling should be applied with great caution, and it should be well 
advertized whenever it occurs. 

The method to deal with this imposed consistency is to account for the measurements at once, 
by merging them into a single component, usually obtained from their weighted mean. A more 
refined treatment is needed when this set of measurements is clearly inconsistent. A general 
method to handle a set of measurements, whether they are consistent or not, is proposed in 
Ref. Yet, for "not too large" inconsistencies, the proposed method yields similar results 

as the rescaling approach adopted by the PDG |10|. To clarify the presentation of the RRt 
scheme, we use the latter: when N inconsistent measurements appear in this analysis, the error 
obtained for their weighted mean is rescaled by the factor VXwm/ (-^ ~ !)> where Xwm is the 
weighted mean x^- 



Related Observables and Consistency*^*^ : In some instances, several observables, al- 
though not identical, are functionally related in a way independent of the theoretical framework 
used for the analysis. The number of such instances is denoted Nest, and the effective number 
of measurements, the one to be used to compute degrees of freedom, is defined by 

N!^p = Nc.p - iVcst . (30) 

An example is provided by the set of measurements yielding separately \ Vub\, \Vcb\ and |14f,/ycfe|. 
The ratio of the first two should be compatible with the third measurement, whether or not the 
SM is valid. Since the measurements are not referring to a unique observable they cannot be 
merged simply into a single component, as above. One should normalize their contribution to 
ensure that they do not contribute to the overall value, if they are in the best possible mutual 
agreement, independently of the theoretical framework used for the analysis. This normalization 
is in fact what is done in the case of identical observables. It is irrelevant for the metrological 
phase of the analysis, and for the third phase, where one searches for specific sign of new physics: 
then, any constant can be added to the without affecting the result. However, it is relevant 
for the second phase, where one probes the SM: a statistical fluctuation in the set \Vub\, 
and |Vu6/T4b| which makes them violate their functional relation should not trigger a claim for 
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new physics. In this example, the normahzation constant is obtained as the maximal value of 



the function of the two variables | V^^^° \ 



I T/theo I 
I ^cb I 



theo I 
ub I 



c 

X £ 



exp 



cxp 



X C, 



exp 




I ^ub 
I T/theo I 

I T/theo I 
_ I ^ub I 
I T/theo I 
I ^cb I 



(31) 



Here as well, care should be taken not to normalize that way the contributions of observables 
the functional connection of which is model dependent. For instance, the measurements leading 
to sin7 and vr — /3 — a are sensitive to new physics because their measured values may violate 
the SM functional relation sin7 = sin(7r — /3 — q) 0. 



Normalization: More generally, the normalization of each individual likelihood compo- 
nent is chosen such that its maximal value is equal to one. This is not important for the analysis, 
but it is convenient: it ensures that a measurement does not contribute numerically to the over- 
all value if it is in the best possible agreement with theory, and that the (so-called) takes 
only positive values. In the pure Gaussian case, it implies simply to drop the normalization 
constant of Eq. (|2^): one is thus recovering the standard definition. 



3.1.2 The Theoretical Likehhood 

The theoretical component of the likelihood is given by the product 

Nqcd 

Aheo(yQCD) = n Aheo(^) , (32) 
i=l 

where the individual likelihood components £theo(^) account for the partial knowledge available 
on the yqcD parameters {e.g., fs^) including more or less accurately known correlations be- 
tween them {e.g., fB^/fBs)- Ideally, one should incorporate in £exp measurements from which 
constraints on yqcD parameters can be derived. By doing so, one could remove altogether the 
theoretical component of the likelihood. However, this is not what is done, because usually 
there is no such measurement: the a priori knowledge on the yqcD stems rather from educated 
guesswork^. As a result, the vCtheo(0 components are incorporated by hand in Eq. (|32|) and 
they can hardly be considered as issued from probability distribution functions (PDF). In effect, 
their mere presence in the discussion is a clear sign that the problem at hand is ill-defined. It 
demonstrates that a (here critical) piece of information is coming neither from experimental, nor 
from statistically limited computations, but from somewhere else: from the mind of physicists. 
At present, these components play a leading role in the analysis and it is mandatory to handle 
them with the greatest caution. 



The Default Scheme - Range Fit {R^t): In the scheme we propose, the theoretical 
likelihoods £theo(0 do not contribute to the of the fit while the corresponding i/qcd param- 
eters take values within ranges, thereafter termed "allowed ranges" and denoted [?/qcd]- The 

® It is wor th p ointing out that an apparent functional violation is present (since long ago) in the available data 
(c/.. Section + |14.p + < 1. 

The same remark applies to experimental systematics, but, since these are usually not the dominant part of 
the experimental uncertainties, the problem is less acute. 
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numerical derivation of these ranges is discussed in Sections ^ and Most of them are identified 
to the ranges [yqcD — fgyst il/QCD + o"syst], where cJsyst is the theoretical systematics evaluated 
for Uqcd- Hence yqcD values are treated on an equal footing, irrespective of how close they are 
from the edges of the allowed range. Instances where even only one of the ?/qcd trespasses its 
range are not considered^. 

This is the unique, simple, and clear-cut assumption made in the i?fit scheme: yqcD parameters 
are bound to remain within predefined allowed ranges. The i?fit scheme departs from a perfect 
frequentist analysis only because the allowed ranges [^/qcd] do not extend to the whole physical 
space where the parameters could a priori take their values^. This should not be understood as 
implying that a uniform PDF is ascribed to each yqcD parameter. This important remark is 
further discussed in Section and in Appendix 



This unique and minimal assumption, is nevertheless a strong assumption: all the results ob- 
tained should be understood as valid only if all the assumed allowed ranges contain the true 
values of their yqcD parameters. But there is no guarantee that this is the case, and this re- 
striction should be kept in mind. On the other hand, also the contrary is true: if the ranges are 
chosen too big, one may miss a discovery. 

3.2 Metrology 

For metrology, one is not interested in the quality of the agreement between data and the theory 
as a whole. Rather, taking for granted that the theory as a whole is correct, one is only interested 
in the quality of the agreement between data and various realizations of the theory, specified 



by distinct sets of ymod values. More precisely, as discussed in Section |3.2.l| , the realizations of 
the theory one considers are under-specified by various subsets of so-called relevant parameters 
values. In the following we denote 



the absolute minimal value of the function of Eq. (23), obtained when letting all N^^d pa- 
rameters free to vary. 

In principle, this absolute minimal value does not correspond to a unique ymod location. This is 
because measurements (resp. theoretical predictions) entering in the analysis are all affected by 
more or less important experimental (resp. theoretical) systematics. These systematics being 
handled by means of allowed ranges, there is always a multi-dimensional degeneracy for any 
value of ■ 

In practice, with the presently available observables, theoretical systematics play a prominent 
role. If one does not incorporate significant sin2/3 measurements in the analysis, the domain 
where = Xmin-j/mod ^® noticeably wide. For convenience, in the following we refer to this 
domain as y^o*^' ^ ^ unique point in the ymod space were leading to Xmin;?/mocr "^^^ projec- 
tions of the domain onto one dimensional spaces result in finite intervals within which the 



In the case of model dependence (c./., Section 3.1.1) the allowed values for the discrete parameter m labelling 
the models correspond to the set of models deemed acceptable. -Rfit is allowed to select at will any one within this 
set, in the same way that it is allowed to select a j/qcd parameter at will within its allowed range. In practice, 
when yqcD parameters cannot be handled beforehand as explained in Section ^ the actuation of the allowed 
range in CkmFitter is obtained using the Set Limit option of the MINUIT package. It is equivalent to set the 
component jCtheo{i) to unity when the corresponding yQco{i) parameter is within [t/QCD(i)], and to zero otherwise. 

^ Not all t/QCD parameters need to be given an a priori allowed range: e.g., values taken by final state strong 
interaction phases (FSI) appearing in B decays are not necessarily theoretically constrained. 
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data analysis cannot make distinction (and similarly for two-dimensional spaces). When sin2/3 
is incorporated in the analysis, one adds a measurement with negligible systematics which lifts 
partially the degeneracy and makes some projections of become point-like. 

This degeneracy should be treated carefully when one is exploring a sub-space a of ymod'- points 
widely apart in a can lead to the same x^, provided the other parameter values are changed 
accordingly. However, except for numerical accidents, identical x^(ymod) = Xminiy^od '^^^^^^ im- 
ply identical £exp components, and hence identical predictions: 2;theo(ymod) values are constant 
within v^Ia- 

Ideally, for metrological purposes, one should attempt to estimate as best as possible the com- 
plete ymod set. In that case, one should use the offset-corrected \^ 

Ax^(ymod) = X^(ymod) - Xmin;j/,„od : (34) 

where x^(ymod) is the for a given set of model parameters ymod- The minimal value of 
Ax^(?/mod) is zero, by construction. This ensures that, to be consistent with the assumption 
that the SM is correct, CLs equal to unity are obtained when exploring the i/mod space (namely, 
once Uraod enters the yj^o*^ domain). In a Gaussian situation, one would then directly obtain the 
CL for a particular set of ymod values as 

^(ymod) = Prob(Ax^(ymod), iVdof) , (35) 
with iVdof = Min(iV|^p, TVmod), where N^^^ is defined in Eq. (||). 

3.2.1 Relevant and Less Relevant Parameters 

However, one is not necessarily interested in all the ymod values, but only in a subset of them. 
This can be for two distinct reasons: 

• The other parameters being deemed less relevant. For instance, in the SM, CP violation 
can be summarized by the value taken by the Jarlskog parameter J, or by the value taken 
by r] (in the Wolfenstein parameterization): the other CKM parameters and the ?/qcd 
parameters may thus conceivably be considered of lower interest. More generally, one is 
rarely considering a CKM fit as a means to pin down anything else than CKM parameters, 
least of all yqcD parameters]^. 

• Parameters that cannot be significantly constrained by the input data of the CKM fit. 
This is the case for most of the non-CKM parameters: yqcD parameters, but also the top 
quark mass, etc. 

In practice, the ymod parameters usually retained as relevant for the discussion are p and 
fj. The other parameters A, A, the top quark mass (etc.) and all the yqcD are considered as 
subsidiary parameters, merely to be taken into account in the analysis, but irrelevant for the 
discussion. In that case, the aim of the metrological stage of the analysis is to set CLs in the 
(/), ff) plane. 

We denote by a the subset of Na parameters under discussion {e.g., a = {p, f/} and /i the A''^ 

^"Although it can be argued that, while theoretical uncertainties dominate, pinning down j/qcd parameters 
might turn out to be the main (and not so interesting) achievement of a CKM analysis... 
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remaining ymod parameters^ 



The goal is to set CLs in the a space, irrespective of the /U values. 

The smaller the region in the a space where the CL is sizeable (above CLcut = 0.05, say) the 
stronger the constraint is. The ultimate (and unattainable) goal being to make this allowed 
region shrink to a point: it would then correspond to the 'true' a. This means that one seeks 
to exclude the largest possible region of the a space. To do so, for a fixed value of a, one has to 
find the fi values which maximize the agreement between data and theory, and set the CL on a 
at the value corresponding to this optimized /i 

CL(a) = Max^{CL(a,/x)} . (36) 

Proceeding that way, one uses the most conservative estimate for a given a point: this point will 
be engulfed in the excluded region only if CL{a, < CLcut, V/u. Stated differently, the CLs one 



is interested in are upper bounds of confidence levels. In effect, as discussed in section |3.2.3 
this is the standard procedure one uses to obtain CLs for a sub-set of fitted parameters. 



3.2.2 Metrology of Relevant Parameters 

According to the above discussion, we denote 

Xmm;/.(a) > (37) 

the minimal value of the function of Eq. (|2^), for a fixed value of a, when letting all 
parameters free to vary. For metrological purposes, one uses the offset-corrected 

Ax^(a) = Xmin; t^ia) - Xmin;j/^od > (38) 

the minimal value of which is zero, by construction: it is reached when a enters the domain. 

Since only the minimal value of the with respect to fj, enters the iJfit analysis, when ^ contains 
a yQCD(i) parameter which appears only in one measurement i, it is advisable to absorb its effect 
by computing beforehand 

exp ( ^ ) max : j/Q CD 0) = MaXj^Q^jj(j){£exp(«)Aheo(2/QCD)} , (39) 

to clarify the analysis[^ (see Section ^). 



Gaussian Case: In a purely Gaussian situation one would directly obtain the CL for a as 

P(a) =Prob(Ax'(a),Afdof) , (40) 

where N^oi = yim{N^^p — N^, Na). Equivalently, one may derive the same CL from the covari- 
ance matrix obtained from the fit leading to the absolute minimum, if in the a-region under 
consideration, the is parabolic. 

It is worth to stress that this splitting is arbitrary and that it can be changed at wiU: for instance one may 
decide to focus only on a = {J}, or to consider a = {sin2a, sin2/3}, etc. 

This should be done for instance for FSI phases, most notably for the determination of the angles a and 7. 
But this cannot be done for the product fB^VBd, because it appears in Amd but also indirectly in Ams, since 
fs^VBd and fs^^/B^ are theoretically linked. 
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Non-Gaussian Case: In a non-Gaussian situation, one has to consider Ax^(a) as a test 
statistics, and one has to rely on a Monte Carlo simulation to obtain its expected distribution 



in order to compute V{a). As further discussed in Section 3.3, this does not imply taking a 



Bayesian approach and to make use of PDFs for the unknown theoretical parameters /i. 



For the sake of simplicity, we use Eq. ( [40D in the present work. This implies that the experimental 
component vCexp(a;exp — 3;theo(ymod)) is free from non Gaussian contributions and inconsistent 
measurements. However, the Ax^(a) function itself does not have to be parabolic. What matters 
is that the >Cexp components are derived from Gaussian measurements (c./.. Section 3.2.3| for an 
example), being understood that no £theo components are present. 

3.2.3 Illustrations 

To illustrate the above definitions we consider two specific examples in this sectiorf^ 

Standard Situation: Consider an analysis which depends on only two quantities: the 
first is a fundamental parameter a, and the second is a QCD parameter i/qcd- We assume here 
that the situation is a standard one, where it turns out that both quantities are simultaneously 
measurable: the full X^ia^yoiC'D) function takes the form 



-2c 



' a-a^ \ ( j/QCD -?/qcd \ 2 

. ^[«] i I ^[yqCD] )+^--^»'^QCB 



where c is a correlation coefficient. Applying the i?fit scheme, the 95% CL interval for a is 
obtained as follows. One first computes the offset-corrected 

Xmin; j/qcd («)- Xmin; a, j/qcd ' 

(42) 



a — a 



0\2 



1 - c") . (43) 



The limits a± of the 95% CL interval are such that 

Prob(Ax'(a±), 1) = 0.05 ^ Ax\a±) = 3.84 , (44) 

and hence 

a± = aO ± 1.96 -^M= , (45) 



c 



which is just the standard answer for the 95% CL interval of a, if one disregards information on 

yqcD- 

Measurement of sin2/3: If one uses a = {p, fj}, a measurement of sin2/3 alone yields a 
double infinite degeneracy corresponding to the solutions of Eq. ([2l|) , namely 



l±Jl- (sin2/3e: ^2 



A more involved example is discussed in Appendix |c| 
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Along the two above straight lines in the {p, rj) plane, x^io-) = Xmm;y^ad ~ ^' There is no /x 
parameters here, and hence 

Ay^(a) = Y^- fal-Y^. = /^ si^2/3exp-sin2Aheo y , . 

I"; Xram■,^l\^) Amm;y„,od \^ cr[sin2/?] / ' ^ ' 

Using Eq. ( ^0|) one gets the CL in the (p, fj) plane 

P(p,7?) = Prob(Ax'(a),l) . (48) 
While the double infinite degeneracy of Ax^(a) = clearly precludes a parabolic behavior for 



this function, Eq. (48) remains exact due to the right hand side of Eq. (47) 



3.3 Probing the SM 

By construction, the metrological phase is unable to detect a failure of the SM to describe the 
data. This is because Eq. (]38|) wipes out the information contained in /tmin-ymod ' "^^^^ value is a 
measure (a test statistics) of the best possible agreement between data and theory. Ideally, in a 
pure Gaussian case, this quantity could be turned into a CL referring to the SM as a whole in 
a straightforward way 

V{SM) < Prob(xL„;,,.„, , A^dof ) , (49) 

with A'^dof = -Al^fp — Amod, were it be a positive number. The whole Standard Model being at 
stake one should not rely on Eq. (|49D, but use a Monte Carlo simulation to obtain the expected 
distribution of Xmin j/mod ' ^^^^ Monte Carlo simulation is built as follows]^: 

• One selects a set of ymod values within y^^^ and assumes it to be the true one|^. 

• Then one generates all Xexp(^), following the distribution of individual experimental likeli- 
hood component Cexp{i)-, having reset their central values to the values Xexp(^) = a;theo(0 
computed with the above ymod set. In case of significant experimental systematics, this 
implies the use of appropriate PDFs as discussed in Section 

• In contrast to the above, one does not modify the £thco component of the likelihood: their 
central values are kept to their original settings. This is because these central values are 
not random numbers, but parameters contributing to the definition of C. 

• Then one computes the minimum of the by letting all j/mod ^se to vary, as is done in 
the actual data analysis. 



• Prom this sample of Monte Carlo simulations, one builds J-{'x^)^ the distribution of 
Y^:„.„ , normalized to unity. 

The CL referring to the SM as a whole is then 



P(SM) < j HX^) dx^ , (50) 
which is the upper bound to the CL one may set on the SM. 
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Fo r the sake of generality, the theoretical likelihood is not assumed to be necessarily the trivial _Rfit one (c./., 



3.1.2) 



Section 

As discussed above, the various optimal realizations yield identical theoretical predictions, the choice made 
for a particular j/mod within y'^l^ is thus irrelevant. It was explicitly checked that the outcome of the analysis 
does not depend on this choice. 
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3.4 Probing New Physics 

If the above analysis establishes that the SM cannot accommodate the data, the next step is to 
probe for the new physics revealed by the observed discrepancy. The goal is akin to metrology: 
it is to measure new physical parameters ^np (whose values, for example, are zero if the SM 
holds) complementing the set of ytheo parameters of the SM. The treatment is identical to the 
one of Section using a = {?/np}- The outcome of the analysis is for example a 95% CL 
domain of allowed values for yNP defined, in a first approximation, from Eq. ( |40| ) 

P(yNp) = Prob(Ax'(yNp), N^p) > 0.05 . (51) 

Even if the SM cannot be said to be in significant disagreement with data, it remains worthwhile 
to perform this metrology of new physics, for two reasons: 

• It might be able to faster detect first signs of discrepancy between data and the SM, if 
the theoretical extension used in the analysis turns out to be the right one. The two 
approaches are complementary, the first (c./.. Section ^^) leads to a general statement 
about the validity of the SM, independently of any assumption for the new physics, the 
second is specific to a particular extension of the SM. In that sense it is less satisfactory. 
Being complementary, the two approaches can disagree: the first may conclude that the 
SM is in acceptable agreement with data, while the second may exclude the SM value 
Vnp = 0, and, conversely, the first may invalidate the SM, while the second may lead to a 
fairly good value of 'P(yNP = 0), if the extension of the SM under consideration is not on 
the right track. 

• The most sensitive observables, and the precision to be aimed at for their determination 
cannot be derived by any other means than by this type of analysis. When considering 
new experiments, it is therefore particularly valuable to have a sensitive model of new 
physics, to prioritize the efforts and set the precisions to be achieved. 

4 Alternative Statistical Treatments 

Several alternative statistical treatments are available. Three of them are briefly discussed 
below: however not all variations are considered. The relative merits and limitations of the 
three treatments will not be discussed extensively here, except to point out features of the i?flt 
scheme. 

4.1 Reminder: The RRt Scheme 

Let us briefly re-sketch the main steps of an analysis in the -Rflt scheme: for a given point a in 
the parameter space {e.g., a = {p, f]}) i?fit proceeds to: 

• Find the overall minimal Xmin y,nod '^^^^ respect to all theoretical parameters. 

• Perform a discrete, although fine scan of the a space, and minimize Xmm- ^^(^) with respect 
to the remaining parameters /i, for each point: the yqcD parameters being allowed to vary 
freely within their [?/qcd] ranges. 

• Calculate the offset-corrected CL {V{a) of Eq. (|40|)), for each point. It is the upper bound 
of the confidence levels one may set on a, which corresponds to the best possible set of 
theoretical parameters //. 
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• Compute the CL of the overall Xmin-y^^^ by means of a Monte Carlo Simulation. It is the 
upper bound of the confidence levels one may set on the SM, which corresponds to the 
best possible set of ?/qcd parameters. 

The Rht scheme suffers from two drawbacks: 

• It relies on a priori allowed ranges for the yqcD parameters. 

• In the hopeful case where data are such that the method is lead to "rule out" the SM, 
it provides no indication as to which ygcD parameter(s) should be allowed to trespass its 
allowed range, and by how much, to rescue the SM. 

4.2 The 95% Scan Method 

The 95% CL Scan method Q does not incorporate the theoretical component Aheo except to 
define allowed ranges for the yqcD values: in effect, this is equivalent to the i?fit scheme. These 
yqcD values are equidistantly scanned within their allowed ranges. For each set of yqcD values 
(denoted model in the 95% CL Scan method terminology) three operations are performed: 

1. One determines 

(yqCD) , (52) 

the minimal value of the function of Eq. ( |2^ ) , from £exp only, for a fixed set of yqcD 
values, when letting all ytheo parameters free to vary. One then computes the confidence 
level 

ViVQCB) = Prob(Xmm;j/thco(yQCD),^^dof) , (53) 

where A^dof = ^exp ~ ^theo- If 'P(yQCD) is above a threshold CLcut (usually CLcut = 0.05) 
the model is considered as acceptable, and selected. The SM is ruled out if no model is 
selected. 

2. Among the ytheo values, a subset a is retained as central values to be displayed for the 
current model (if selected) and CLs in the a space are derived using 

^X^(a,yQCD) = Xmm;j/thco7^(i("'2^QCD) (yqco) , (54) 

and 

V{a, yqcD) = Prob(Ax^(a, yqco), iVdof) , (55) 
where iVdof = Min(iV^fp - iVthco + Na, N^). 

3. The method concludes by a graphical display, for all selected models, of the contours in 
the a space defined by 'P(a,yQCD) = CLcont (with CLcont = 0.05). 

4.2.1 Comparison with i?fit 

Although the outcome of the 95% CL Scan method, the graphical display, is quite different from 
the -Rfit one, both schemes are close in nature: they are frequentist approaches, flawed by the 
same double drawback mentioned in the previous section. In addition, while it is built on rather 
flrm ground, the 95% CL Scan method presents several unwelcome features which are reviewed 
in Appendix The main differences between the two methods are: 
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• whereas R&t seeks for a statistical statement pertaining to a, and only to a, the 95% CL 
Scan method leads to statements on a, for given values of n which take the form of the 
95% CL contours. 

• to correct for this, the 95% CL Scan method may conclude by an envelope, which delimits 
an allowed region in the a space with at least 95% CL. 

• -Rfit draws a single smooth CL surface. From this surface one can read off the 95% CL 
contour, or define a family of contours, each corresponding to a given CL. These contours 
encircle a domain, the plateau of the CL surface, where the CL is essentially equal to unity. 

• R&t is flexible. The default treatment can be extended to atone for the second fundamental 
drawbacks, and to accommodate for a smooth transition toward the Bayesian treatment, 
while, nevertheless, keeping part of the virtues of the 95% CL Scan method and of the 
i^fit scheme. This is discussed in the next section. 

4.3 The Extended Conservative Method (Emt) 

The -Rfit scheme uses >Ctheo(0 functions which trivially take only two values: either 1 within the 
allowed range, or outside, thereby strictly forbidding any yqcD to trespass [yqco]- Instead, 
the extended ERfit scheme uses for £theo(0 functions which take values between 1 and 0. They 
are equal to 1 within [yqco] (there, they do not contribute at all to the full x^, and one recovers 
the -Rfit scheme) and drop smoothly to outside. These functions are not PDFs: they are not 
combined the ones with the others through convolutions, and hence (see next Section) the ERfit 
scheme is not a Bayesian scheme. 

The precise way the functions decrease down to zero is obviously arbitrary: one needs to define 
a standard. The proposed expressions for £theo(^) are presented in Section ^. Their relevant 
characteristic is the following: they use two continuously varying parameters, denoted ^ and 
K. The first parameter is a scale factor which fixes the allowed range where £theo(0 — 1- The 
second parameter determines the transition to zero. The parameter values permit to cover a 
large spectrum of schemes, ranging from i?fit (C = 1, k = 0), to a Gaussian scheme (C = 0, 
K = 1) and defining a standard^ denoted ERfit, for which C = 1? and k ~ 0.8. 

Because ER&t acknowledges the fact that the allowed ranges should not be taken literally, it 
offers two advantages over Rfit: 

• ERfit is more conservative than Rfit: by construction, a ERfit CL is always larger than the 
corresponding Rfit one, and in the {p, ff) plane its CL surface exhibits the same plateau 
at CL = 1 (i.e., the -Rfit and ERfit y^^j spaces are identical). 

• In case the SM tends to be ruled out by i?fit, the ERfit scheme is able to detect the 
eventual yqcD parameter(s) which, if allowed to trespass its allowed range, would restore 
an acceptable agreement between data and theory, and which value it should take. 

Despite the two above arguments in favor of ERfit, Rfit is chosen as the scheme advocated in 
this paper rather than ERfit: because it uses a simpler and unique presciption to incorporate 
theoretical systematics, it is less prone to be confused with a Bayesian treatment. Moreover, 
ERfit does not provide a clear-cut distinction between statistical and theoretical systematic 

Obviously, it would be better if theorists, and not experimentalists, choose for these two parameters the 
values which appear the most adequate for each of their predictions. 
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errors in the fit. Finally, in cases where one determines theoretical parameters via the fit, as 
it is the case, e.g., for the quantity /b^V^B^, R^t is the natural choice. But, obviously, if i?fit 
concludes to a SM failure, then ERfit should be used. 



4.4 The Bayesian Treatment 

The Bayesian treatment [^, ^ considers £ as a PDF, from which is defined J- {a), the PDF of a, 
through the convolution 

T{a) = C J £(ymod) S{a - a(ymod)) dymod , (56) 

where the constant C is computed a posteriori to ensure the normalization to unity of T{a). In 
practice, the integral can be obtained very conveniently by Monte Carlo techniques. For each 
point in the a space one sets a confidence level CL(a), for example, according to 

CL(o) = j r{a) da , (57) 

r{a')<r{a) 

but another definition for the domain of integration can de chosen. The method concludes by a 
graphical display of CL. In particular, the 95% CL contour can be read-off among others. New 
physics is not meant to be detected by the Bayesian treatment: it is aimed at metrology mostly. 



4.4.1 Comparison with Rfvt 

Although their graphical display appear similar, the Bayesian treatment and the -Rfit scheme 
depart significantly: the meaning attached to a given CL value are not the same. For the 
Bayesian treatment, the CL is a quantity defined, using Eq. (^), for example by Eq. (^7|). The 
justification of this definition lies in the understanding that a CL value is meant to provide a 
quantitative measure of our qualitative degree of belief. Whereas one understands qualitatively 
well what is meant by degree of belief, because of its lack of formal definition, one cannot check 
that it is indeed well measured by the CL: the argument is thus circular. One is left with the 
sheer definition of Eq. (57), which, being just a definition, suffers no discussion. 



The key point in the Bayesian treatment is the use of Eq. ([5q), even though the likelihood 
contains theoretical components. This implies that the yqcD parameters, which stem from 
theorist computations, are to be considered as random realizations (sic) of their true values. 
The PDFs of these 'random' numbers are then drawned from guess-work (The [yqco] ranges do 
not fare better with respect to that.). For self-consistency, if one assumes that a large number 
of theorists perform the same yqcD computation, the distribution of their results should then 
be interpreted as a determination of the yqcD PDF. Once injected in Eq. (|5^), this PDF, the 
shape of which contains no information on nature, but information on the way physicist mind 
work, will be transformed into information pertaining to nature. This entails to a surprising 
confusion between what is an experimental result and what is a thinking result. As illustrated 
in Appendix ^ and in Section 7.2.1 , it is less the ad hoc shapes of the PDFs which are at stake 



than the implication of using Eq. (d 



5 Likelihoods and Systematic Errors 

In Section ^ we have defined the basic formalism of the i?fit scheme. The treatment of experi- 
mental and theoretical systematics is the subject of this section. 
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Let xq be a quantity, which is not a random variable, but which is not perfectly known. We will 
consider in turn two quantities of this type: 

• A theoretical parameter which is not well determined {e.g., xq = fsd)- the theoretical 
prediction of an observable depends on xq (e.g., AMb^)- 

• An experimental bias due to detector/analysis defects: the measurement should be cor- 
rected for this bias. 

It is the purpose of this section to suggest a prescription of how to incorporate the limited 
knowledge of such quantities into the analysis. The standard treatment of this problem relies 
on a analysis 

2 / ^exp ~ 3^theo \ , f XQ — Xo\'^ /ro\ 



^exp 



where 



• a^theo (resp. Xexp) depends on xq, if the latter is a theoretical parameter (resp. experimental 
systematics), 

• Xq is the expected central value of xq, 

• (To is the uncertainty on xq. 

This standard treatment is satisfactory as long as the degree of belief we put on the knowledge 
of the value of xq is peaked at xq and distributed like a Gaussian. This is usually summarized 
by 

xo = xo± ao ■ (59) 

However, this is not necessarily what is intended to be meant by Eq. (^9|). Rather, the theorist 
(resp. the experimentalist) may mean that the prediction (resp. the measurement) can take any 
value obtained by varying xq at will within the range [xq — C^^o, xq + Cf^o] (denoted the allowed 
range below, where C is a constant scale factor of order unity), but that it is unlikely that xq 
takes its true value outside the allowed range. This does not imply that the possible values are 
equally distributed within the allowed range: they are not distributed at a/Q If Eq. (^) is 
given such a meaning, then the statistical analysis should treat all xq values within the allowed 
range on the same footing (which again does not imply with equal probability): this corresponds 
to the Rat scheme (with C = 1). 



This is not the case for the expression of Eq. (58) since the farther xq moves away from xq, 
the larger becomes the related component of x^- 

On the other hand, it might also be useful to define specific tails instead of sharp cuts, thus 
allowing the theoretical parameters to leave their allowed ranges, if needed: this corresponds to 
the ER&t scheme. 

The idea is to move from a pure x"^ analysis to a log-likelihood one, redefining the x"^ to be 

X' = f - 21n£3yst(xo) , (60) 

where vCsyst(a;o)) hereafter termed the Hat function, is a function equal to unity for xq within 
the allowed range. Its precise definition is given below. 

In some cases {e.g., lattice QCD) statistical fluctuations may enter in the computation. In such instances 
one may reliably define a Gaussian likelihood for this component of the theoretical uncertainty. 
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Figure 3: The k parameter as a function of Q (see text). 
5.1 The Hat Function 

The Hat function £syst(2;0) i^-, C) is a continuous function defined as 

( , Vxo e [xq ± C(Jo] 

- 21„£,..(.o,.,C) = I ^ [^^^ ^ (01) 

where the constant k determines the behavior of the function outside the allowed range. For the 
iifit scheme k = is used. To define a standard n can be chosen to be a function of Q such that 
the relative normalizations of /Zgystl^^O) '^j C) (briefly viewed here, for the purpose of defining a 
standard, as a PDF) within and outside the allowed range equal those of a Gaussian of width 

J Csyst{xo,K,C) dxQ- J e"*^ dt = \/nCao ■ (62) 

-oo 

The parameter k is numerically computed as a function of C- The result is shown in Fig. ^, in 
the range of interest < < 3. For the limit C ^ one obtains k ^ 1, and the Hat becomes a 
pure Gaussian. The ERfit scheme is defined by ^ = 1, for which one obtains k ~ 0.8. 

Examples of Hat functions with xq = and do = 1 are shown on the left plot of Fig. ^ Being a 
function, and not a PDF, vCsyst(3^o) needs not be normalized to unity. 



5.2 Combining Statistical and Systematic Uncertainties 

Having defined £syst (2^0)1 and following Eq. (^9|), one proceeds with the minimization of the 
of Eq. (|^) by allowing xq to vary freely. 

For theoretical systematics, the result depends on the way xq enters Xtheo) and not much more 
can be said in generality. 
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Figure 4: The left hand plot shows the Hat functions (xq = and (Tq = 1) used for the -Rfit 
scheme, the ERfit scheme and the Gaussian treatment. The right hand plot shows the combined 
likelihood exp>Csyst (with xq = and iToxp = = obtained from Eq. for the i?fit scheme, 
the ERm scheme, a convolution of a Gaussian with a uniform distribution (hence taken as a 
PDF, following the Bayesian approach) and a convolution of two Gaussians (see Appendix^. 

For experimental and theoretical systematics where xq can be assumed to be an unknown offset]^ 
the quantity to be confronted to the theoretical prediction Xtheo is simply Xexp + xq. Omitting 
the details of straightforward calculations, and cissuming that xq — (otherwise Xexp 

should be 

corrected for it), one obtains, after minimization of the with respect to xq: 



^exp 



2 

min; xq 



. 



CcTo <| Xexp - Xtheo l< Ml + {^ f) : xLn;.o = (^^^ 



hco| — C°"o 



L-exp 



2;theo 1^ <,0-oU + [ — ) ) . Xmm;xo " -^^M^^ 



cxp 



{if- 



In the limit C ~^ (and hence, k — > 1) only the third instance is met, and one recovers the usual 
rule of adding in quadrature the statistical and the systematic uncertainties. Otherwise, the 
result is non trivial. An example of the effective likelihood cxp-^syst(^exp — ^theo) — — ^Xmin-a^o 
(with Xq = and Uexp = o"o = 1) is shown in the right hand plot of Fig. |^ for the i?fit scheme, 
the ERfit scheme, a convolution of a Gaussian with a uniform distribution (hence taken as a 
PDF, following the Bayesian approach) and a convolution of two Gaussians. 



If systematics take the form of an unknown multiplicative factor, and this is often the case for theoretical 
uncertainties, a treatment similar to the one discussed here applies. 
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6 Fit Ingredients 



This section provides a compendium of the measurements and SM predictions entering the overah 
constrained CKM fit. In some cases, we pre-combine compatible measurements by means of a 
simple weighted mean in order to speed up the fit by reducing the effective number of degrees of 
freedom (see the discussion in Section |3.1.1j ) . Since the solution of a x'^ minimization of Gaussian 
distributed measurements corresponds to the weighted mean, this entails no loss of information. 
Below, we give a status of the input quantities used. The corresponding numerical values that 
enter the fit and the treatment of their uncertainties within R&t are summarized in Table 0. 



6.1 The CKM Matrix Elements 

• l^dl '• The matrix element \Vud\ has been extracted by means of three different methods: 
super-allowed nuclear /3-decays, neutron /3-decay and pion /3-decay. 

Using the lifetime measurements of super-allowed nuclear /3-decays with pure Fermi- 
transitions (O"*" — > 0^), \Vud\ can be extracted with high precision. Averaging the val- 



ues of nine different super-allowed nuclear /3-decays Jl^ the result including nucleus 



independent and nucleus-dependent radiative corrections as well as charge-dependent cor- 



rections (see Ref. |18| for a compendium of references) is: \Vud\ = 0.97400 ± 0.00014exp ± 
0.00048thco [18) 17 1- The precision of \Vud\ from nuclear /3-decays is often questioned in 
light of the observed '2(7' deviation from the unitarity condition when combining this value 
with the best knowledge for \Vus\ and \Vub\- A possible enhancement of \Vud\ is predicted 
by a quark-meson coupling model due to a change of charge symmetry violation for quarks 
inside bound nucleons compared to unbound nucleons |l^. Since the status here is un- 
clear, the error has been enlarged by the amount of the possible correction using the PDG 
rescaling @: = 0.9740 ± 0.0010.[3 



For the neutron /3-decay no nuclear structure effects play any role. However, \Vud\ has to 
be extracted from two quantities, the neutron lifetime and the ratio QA/dv- In contrast 
to nuclear /3-decays these measurements are not dominated by theoretical uncertainties. 
The weighted mean for the neutron lifetime measurements is r„ = (885.7 it 1.0) s [pO] and 



the average value for gA/dv reads —1.2699 it 0.0029 [21] where the error was rescaled by 
a factor of two due to inconsistencies in the data set. Combining these numbers we get: 
\Vud\ = 0.9738 ± 0.0020g^/g^ T-,^ ± 0.0004].ad) where the error is dominated by the experi- 
mental error on QA/gv- 

The pion /3-decay vr"^ — > ir^e^Ve is a very attractive candidate to extract \Vud\ from the 
branching ratio and the pion lifetime, since it is mediated by a pure vector transition and 
does not suffer from nuclear structure effects. However, due to the small branching ratio, 
BR = (1.025 ± 0.034) x 10"^ |0|, the present statistical accuracy is not competitive with 



the other methods: = 0.967 ± 0.016br ± 0.0009theo- 

Assuming all three measurements to be Gaussian distributed^ we combine them via a 
weighted mean, yielding: = 0.97394 ± 0.00089. 



It should be kept in mind that such a procedure might hide a possible violation of the unitarity condition in 
the first family. 

Being consequent, also in this case, one should treat the theoretical errors as ranges. However, as long as the 
relative uncertainty on A from |Kid| and \Vus\ is much smaller than what one obtains for (p, ff) from constraints 
like |Ki6/v'^[,l, Iea'I, Amd and Ams/Amd, the procedure used is certainly not a critical issue. 
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: The analyses of kaon and hyperon semileptonic decays provide the best determi- 
nation of \Vus\ directly related to A in the Wolfenstein parameterization. However, due to 
theoretical uncertainties from the breakdown of SU(3) flavor symmetry, the hyperon decay 
data are less reliable [^, As a consequence, we use only the value obtained from the 
vector transitions and Kl — > vr e'^i'e Owing to the small electron mass. 



only one form factor plays a role in K^^ decays the functional dependence of which can be 
extracted from data. The form factor value at zero recoil, /i(0), is calculated within the 
framework of chiral perturbation theory and is found to be: //^ ^ (0) = 0.961 it 0.008 



The error estimate for this value was criticized in Ref. |25]. However, a relativistic con- 
stituent quark model, successful in the description of electroweak properties of light mesons 
gives a consistent result: ff^ (0) = 0.963ib0.004 |26]. Channel-independent and channel- 



dependent radiative corrections ||27| , psj as well as charge symmetry {K~^ /Kl) and charge 



independence (vr /vr ) breaking corrections |24] are applied in order to compare the results 
from both channels: /f (0)|Ks| = 0.2134 ± 0.0015exp ± O.OOOl^ad {K-^ ^ vr°e+z^e) and 
/f "'^ (0)|K.| = 0.2101 ±0.0013exp iO.OOOlrad {Kl TT'e+Ue). The weighted average is: 
/f ''"(O)IK.I = 0.2114±0.0016 where the error was rescaled by a factor of 1.6 to account 
for inconsistencies. 

The result then reads |Ks| = 0.2200 ± O.OOlTgxp ± O.OOlStheo = 0.2200 ± 0.0025. As in the 
case for Vud all uncertainties were considered as Gaussian errors. 



• \Vub\ '• Both, inclusive B-decays (6 X„£~z/^), measured at LEP p9, 

sive B-decays {B^ I'^v^, p^i^ui), measured by CLEO |^2|, allow an extraction 

of the third column element [T^blp^ 

The exclusive measurements are dominated by the theoretical uncertainty due to the 
model dependence in the determination of the form factor. The exclusive CLEO mea- 
surements give: \Vub\ = (3.25 ± 0.14statio 29sys ^ 0-55theo) X 10~^, where the error is 
dominated by the theoretical uncertainties. We add the statistical and experimental 
systematic error in quadrature and consider the theoretical error as a range: \Vub\ = 
(3.25 ± 0.29exp ± 0.55thco) x 10"^. There is some hope that exclusive measurements in the 
future may take advantage of unquenched lattice QCD calculations and thus reduce the 
model dependent error. 

The three inclusive LEP measurements rely on different techniques and are combined in 
Ref. 1^^, taking into account all uncorrelated and correlated errors: 

\Vub\ = {AMt^oilistat + det)tlil{b ^ c)t^oit{b ^ n) ± 0.02(r;,) ± 0.19(HQE)) x lO'^. 
The theoretical uncertainty from the Heavy Quark Expansion (HQE) is a combination of 
three sources: the neglect of higher terms including 1/m^, the uncertainty in the b-quark 
mass rrib and from perturbative corrections. The extraction of \Vub\ from the inclusive 
semileptonic BR(6 — > ui~Ui) relies on the validity of quark-hadron duality. Although 
quark-hadron duality can not be expected to be exact, there are good reasons that inclu- 
sive semileptonic decays of beauty hadrons are described quite accurately by HQE 
However, since the analyses have to apply cuts in order to suppress the background from 
6 — > c transitions only a part of the total semileptonic rate is measured which could lead 

The determination of |Ki6| from the lepton endpoint spectrum, obtained by ARGUS and CLEO |35| , 
suffers from large model dependencies |3^ ^ . In addition, possible violations of quark-hadron duality 
might be enhanced in this small part of the phase space. Hence, these results were not taken into account for this 
analysis. 



3C, 31 1, and exclu- 
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to sizable effects from quark-hadron duality violation which are difficult to quantify. In 
the case of the inclusive measurements we combine the theoretical uncertainties from HQE 
and the large systematic uncertainties due to 6 — > c and 6 — > u transitions by adding them 
in quadrature and obtain: |T4b| = (4.04 ± 0.44exp ± 0.54inodci) x 10~^. 

For the combined result of inclusive and exclusive measurements we obtain: \Vub\ = 
(3.49 ± 0.24 ± 0.55theo) x 10~^ where only the first error was used for the weighted mean. 
The maximum of both single ranges was assigned as the final systematic theoretical un- 
certainty. 

\Vcd\ '• Both matrix elements, \Vcd\ and \ Vcs\, can be determined from di-muon production 
in deep inelastic scattering (DIS) of neutrinos and anti- neutrinos on nucleons. In an 
analysis performed by CDHS \Vcd\ and \Vcs\ are extracted by fitting the data of 

three experiments, CDHS El, CCFR pi and CHARM II M], giving: x = 



(4.63 ± 0.34) X 10-3, where = 0.0919 ± 0.0094 ||, ||, g^] is the weighted average of 
semileptonic branching ratios of charmed hadrons produced in neutrino- nucleon DIS. This 
results in: \Vcd\ = 0.224 ± 0.014 



|V^s| : Besides DIS, the matrix element \Vcs\ can be obtained from D^s decays, charm 
tagged W decays and hadronic W decays. 

The average DIS result from CDHS, CCFR and CHARM II is k\Vcs\^Bc = (4.53 ± 0.37) x 
10"^ where k = 0.453ib0.106l^Qggg is the relative size of strange quarks in the sea compared 
to u and d resulting in \Vcs\ = 1-04 it 0.16 |l^ 



Similar to K^s, decays \ Vcs \ can be also extracted from DgS decays. However, the theoretical 



uncertainty in the form factor calculation /i(0) = 0.7 it 0.1 |48] limits the precision: 



Vcs\ = 1-04 lb 0.16 |18|, in perfect agreement with \Vcs\ from DIS. 



Under the assumption that unitarity holds for three families, the ratio Rc = T{W~^ — > 
cq)/T{W+ hadrons) = 'Ei=d,s,b\^ci\'^ /{'Ei=d,s,b;j=u,c\Vji\'^) for W decays is expected to 
be 1/2. The results of all LEP2 experiments are consistent with this expectation and give 
\Vcs\ = 0.97 ± 0.09stat ± 0.07sys I|fo|, |4|, |5^, |3, |2|. The ratio of hadronic and leptonic 



W decays measured by LEP2 provides the tightest bound on \Vcs\ if unitarity for three 
families is assumed: Rc = T{W~^ — > hadrons/r(iy"'" — > leptons) = J2i=d,s,b;j=u,c\^ji\ 



|2 



X 



[l + as{mw)/T^)- From the four LEP experiments \Vcs\ = 0.989±0.016 [jTo| is found where 
the errors on the single measurements are dominated by statistical errors^. 

Very recently, the OPAL collaboration has presented a new direct determination of \Vcs\ 



from W —>■ XcX resulting in |Vcs| = 0.969 ± 0.058 ^3|, which we use in the fit 



I V^bl : In the Wolfenstein parameterization, \ Vcb\ determines the parameter A the precision 
of which plays an important part for the constraints |T4b/Vcb|, \eK\ and Am^. It is obtained 
from exclusive B D^*^!?^ and inclusive semileptonic b decays to charm, b — > (.'UiXc, 
both measured by CLEO and the LEP experiments. The theoretical framework for ex- 
tracting numerical values for \Vcb\ from the measured decay rates is Heavy Quark Effective 
Theory (HQET) |5j] for exclusive measurements and HQE [^] for inclusive measurements. 



The exclusive results are given in the form J^D*{^)\Vcb\, so that they must be divided by 



The measurement of Rc should be used in the fit, rather than the quoted |Vca| determination which is derived 
from it. This piece of information is not used here, for the sake of simphcity. 
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the value of the Isgur-Wise function at zero-recoil. The form factor at zero-recoil, J'-£)*(l), 
is 1 in the heavy quark limit. For finite quark masses the corrections can be calculated in 
HQET. There are open discussions in the literature concerning the calculation of the l/m-g 
corrections for J^£)*{1). Here we use the value = 0.913 ±0.042 g for which several 

references have been taken into account. In the future, the most accurate determinations 
of are expected to come from lattice QCD. A first result with a quite small error 

reads = 0.935 ± 0.030 [|6|. The results from LEP and CLEO read 



^D-(l)|Kb|LEP 
!Fd* (l)|K:f)|cLEO 



cb\ 



0.0350 ± O.OOOTstat ± O.OOlSsys 
0.0424 ± O.OOlSstat ± 0.0019sys 
0.0373 ± 0.0013 
0.0409 ± 0.0014 ± 0.0019theo- 



m |7| 

m 



The combined fit of the CLEO and LEP numbers results in a confidence level of 7%. 

The theoretical error for the inclusive measurement contains the uncertainty in the ki- 
netic energy of the b-quark inside the b-hadron and uncertainties from perturbative 
corrections, the b-quark mass and the neglect of higher order terms in the l/m^ expansion 
including 1/m^ terms [^]. As in the case of the inclusive determination of V^t possible 
violations of quark-hadron duality could imply sizable effects. Future experimental inves- 
tigations should aim to shed more light on this topic. The most recent inclusive results 
read 



|V^b|LEP 
iKfelcLEO 



0.04076 ± 0.00050exp ± 0.00204thco 

0.041 ± O.OOlOstat ± 0.0020sys ± 0.00205theo 




The weighted mean of exclusive and inclusive results is \Vcb\ = (40.76±0.50±2.0theo) x 10~^, 
and is dominated by the inclusive measurement. In light of the controversial experimental 
and theoretical situation in the exclusive sector and possible violations of quark-hadron 
duality the theoretical uncertainty was not further reduced. 

\Vtb\ '• Assuming unitarity for three families, one obtains \Vtb\ from the ratio of the bottom 
quark production in top decays to the total top decay width: \Vtb\ = 0.99 ± 0.15 [62|. The 



unitarity assumption, here explicitly used, could be removed. However, owing to the poor 
precision presently achieved, we do not use this measurement in the global CKM fit. 

I VtsVtb/Vcbl : The inclusive ratio of 6 ^ S7 to 6 — > c£~P^ production provides a measure 
of the third row CKM elements. Present accuracy is only fair: | Vf^V^fc/V^bl = 0.93 it 0.14 it 
0.08 1 63, 34, Hence, the value is not used in the fitting procedure. 



6.2 CP Observables and Mixing 

Constraints on the CKM phase are obtained by the CP-violating observables in the — 
and — systems, and by B'^ — B^ and B'^ — B^ mixing. 

• \^k\ '• Indirect CP violation in the — system is measured by 

= \'^+- + ^^00 , (63) 

with rjj^ (%o) being the ratio of the amplitudes of the long-lived and short-lived neutral 

kaons decaying into two charged (neutral) pions. They have been measured to an accuracy 
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of 1% [gg, 67, 65, The averaged value of Ref. is used in this analysis. Within 
the SM, CP violation is induced by AS = 2 transitions owing to box diagrams. It can 
be related to the CKM-matrix elements by means of the vacuum insertion approximation, 
used to determine the hadronic matrix element 



(64) 



Neglecting the real part of the non-diagonal element of neutral kaon mass matrix M12 , one 
obtains 



\(^k\ 



GjpmfymKfK 



Bk ( r/cc5(xe, Xe)Im[(T4,K*J^] + mtS{xt, xt)lxn[{VtsV:d?' 



+ 2r/rf5(xe,Xt)Im[y,,Kd^ts^;d] 



(65) 



Here, the S{xi the Inami-Lim functions 



fl 9 

S{x) = S{Xi,Xj)i=j = ^(^"'"^Q 



4(1 -x) 2(1 -x)V 2 Vl-a; 



S{xi 



+ 



4 2(1 -Xi) A{l-Xifjxi-Xj 
3 1 

+ [Xi ^ Xj) ^ 



ln(x) , 



-ln(xi) 



(1 - Xi){l - Xj) 



(66) 



depending on the masses of the virtual charm and top quarks in the box diagrams (xj = 
mf/m^y). The QCD corrections to the Inami-Lim functions have been calculated to next- 
to-leading order: ry^c = 1-38 ± 0.53 Q, r]tt = 0.574 ± 0.004 [^] and rjct = 0.47 ± 0.04 Q 
(for a compendium see also |7l|]). The kaon decay constant has been extracted from the 
leptonic decay rate: fx = (159.8 ± 1.4|v„,| ± 0.44theo) MeV 0. The Ks - Kl mass 
difference is known with excellent accuracy, Amx = (3.4885 ± 0.0008) x 10~^^ GeV p^ . 



The main uncertainty in Eq. (|6^) originates from the bag parameter Bk which cannot 
be measured but has to be predicted by theory. The most reliable calculations of Bk 
are supposed to come from lattice QCD. Currently, these calculations are performed only 
under the assumption of SU{3) symmetry using the quenched approximation, i.e., using 
quarks with infinite masses and neglecting the contribution of sea-quarks in closed loops, 
which leads to a substantial reduction in computing time. The world average reads: Bk = 
0.87 ± 0.06 lb 0.14quench where the first error combines statistical and accountable 
systematic uncertainties while the second stands for an estimate of the error introduced 
by the quenched approximation and SU{3) breaking effects. 



e'/^KT : In terms of the neutral kaon amplitude ratios r/+_ 
good approximation 

Re(e7ei^) = ^(1 - |7/oo/??+-P) 



and 7/00, one finds to a very 



(67) 



The first evidence of direct CP violation in the neutral kaon system has been found by the 
NA31 Collaboration at CERN |76| which was not confirmed by the E731 Collaboration 
Since then, measurements at KTeV [ [f^ ] and NA48 [79| have obtained significant positive 
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results while some inconsistencies about the central value remain. The experimental values 



are 




(NA31 [76]) 

T5 M/ ^ Tn4 J 7.4 ± 5.9 (E731 W%) 

(NA48 

for which the weighted mean of (Re(e'/ex)) = (19.2 it 2.5) x 10~^ has a consistency of 
X^/3 = 3.5. The experimental situation being somewhat inconsistent, the theoretical 
prediction in the framework of the SM is still under strong investigations. The basic 
expression is of the form [S0| 

Re(e7ej^) = Im [V*VtdV:,Vud] S {Bq, Bs,ms{ss)) (69) 

with S(. . .) being a function of the hadronic matrix elements, Bq and Bs, of the dimension- 
6 and dimension-8 non-perturbative power corrections which contribute to the effective 
Hamiltonian. The quantitative size of these operators, in particular Bq, and the exact 



ingredients of S(. . .) are still under consideration |80|. 



As a consequence, we shall not use e' /ex in the current analysis. 

Am^ : The frequency of B^ — B^ oscillation is given by the mass difference, Am^, between 
the two B^ mass eigenstates, B^ and i?L- It has been measured to an accuracy of 3% ^ 
(see table |^). In analogy to {exl, B^ — B^ oscillation in the SM is driven by effective flavor- 
changing neutral current (FCNC) processes through /S.B = 2 box diagrams. In contrary 
to |ei<-|, where the large hierarchy in the Inami-Lim functions is partly compensated by the 
CKM matrix elements, the A.B = 2 box diagrams are dominated by top quark exchange 
between the virtual boson lines. This simplifies the theoretical prediction which then 
reads 

^2 



^^d = -^TlBmBjBiBdmy^S{xt)\VtdVtb\ , (70) 
where rjs = 0.55ib0.01 (for a review see ]71]) is a correction to the Inami-Lim function S{xt) 



(see Eq. (|66D ) from perturbative QCD. The leptonic decay constant /b^ has not been mea- 
sured so far and, like the bag parameter Bd, has to be determined by theory, in particular 
lattice QCD. Up to now, calculations are mainly performed in the quenched approximation 
where the different groups find consistent results. The most recent world averages for the 
decay constant is fs^ = (175 ± 20) MeV [^] where the error includes statistical and ac- 
countable systematic uncertainties. The unquenched result is estimated to be about 10% 
higher than the quenched value, fs^ = (200 ±231^^) MeV |8^, |^]. Recently, first (partly) 
unquenched calculations with two degenerate sea quarks were published and are, within 
the given uncertainties, in agreement with the expected increase [p5| , |8^ . The world aver- 
age for the bag parameter in the quenched approximation is Bd = 1.30ib0.12ib0.13, where 
the second error is the estimated uncertainty due to the quenched approximation [p^]. In 



the present work we use fsaVBd = (230 it 28 it 28) MeV [83|, where the second error has 
been symmetrized. 

ArUg : Although Arris itself has only a weak dependence on the CKM phase the ratio 
Arus/ Amd introduces a strong constraint since the dependence of the SM prediction on the 



The consistent, though preliminary measurements of BABAR ^] and Belle js^, were not considered in the 
average quoted in table hi. 
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Figure 5: Preliminary average of the experimental amplitude spectrum using results from 
ALEPH, DELPHI, OPAL, SLD and CDF jg^/ . The dashed and dotted lines give the sensitivity 
(^1.645 X a a) and the 95% limit A+ 1.645 x aA), respectively. Shown in addition are the distri- 
butions obtained from a toy Monte Carlo simulation using as true values Anig = 8, 17, 50 ps""*^. 



parameters tjb and mt cancel in the ratio. Furthermore, the ratio = fBg^/Bs/fBaVBa 
can be calculated more reliably from lattice QCD than fsaV^d alone since most of the 
systematics cancel. For ^, we are combining the average values for quenched calculations 
from Refs. ^ and choose: ^ = 1.16 ± 0.03stat,sys i 0.05quench- 

Limits on — oscillation governed by the mass difference Am^ have been obtained by 
ALEPH m, DELPHI [||, OPAL H, SLD |o| and CDF |l|. A convenient approach to 
average various results on Am^ is the Amplitude Method |9^] (see also the exhaustive study 
in Ref . |^3| ) , which consists of a likelihood fit to the measured proper time distribution with 
the amplitude of the oscillating term being the free parameter at given frequency Am^. 
Fig. m shows the average of the measured amplitude spectrum |^4| with the expected 
spectra for different true Am^ superimposed. The latter have been obtained, following the 
prescription of Ref. , from a toy simulation in which the decay length and momentum 
resolutions are tuned to reproduce the measured errors on the amplitudes (the RMS of 
the relative difference between measured and simulated errors is smaller than 2% in the 
relevant sensitive region of Attt-s). Shown in addition are the experimental sensitivity 
defined as 1.645 x aA for a given Arris and the 95% CL limit which is given by the sum of 
the sensitivity and the measured central amplitude. A sensitivity of 18.0 ps~^ and a lower 
limit for Am^ of 14.9 ps"^ at 95% CL is obtained H]. 
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The information from — oscillations is usually implemented into fits using 



-A\ ~ ((^ ~ and CL(x^-^_^|) = Erfc(|l — A|/(T^/a/2)). However, this procedure 



does not properly interpret the information of the amplitude spectrum. For instance, two 
measured amplitudes Ai and A2, where Ai > 1 and A2 < 1 but Ai — 1 = 1 — A2, result in 
the same confidence level in this approach although it would be natural to assign a larger 
likelihood for an oscillation to Ai than to ^2- We propose an alternative procedure which 
exploits the information from the sign of 1 — ^ by suppressing the module in the above 



definition of 



X\i-A\'- 



xi-A 



2 • 



Erfc" 



iErfcfi^) 
2 \V2aAJ 



n 2 



(71) 



It has been pointed out that the maximum information from the fit to the proper time dis- 
tributions of mixed and unmixed B^{B2) decays is obtained from the ratio of the likelihood 
at given frequency Arus, C{Ams), to the likelihood at infinity, £(Ams 
The logarithm of this ratio reads 



00 



95|,|. 



2Aln£°°(Am, 



{1-Af 



0"1 



(72) 



which is assigned via Eq. ( p^ ) to x^- The behaviour of the above defined 'x^ and likelihood 
functions versus Am-s for the measured amplitude spectrum is plotted in Fig. |6[ The 
dashed line shows the drawback of using xfi_A|- ^he strongest signal yield is obtained at 
the crossings A = 1 and not at the maximum amplitude situated around 17 ps~^. This 
drawback is cured for Xi-a (solid line). The dotted line shows the ratio providing 
a significantly stronger constraint. For the current analysis we decided not to use the 
likelihood ratio since the validity of the normalisation of the likelihood which allows to 
identify C°° with a probability density function is questionable. This problem could be 
circumvented by means of a realistic Monte Carlo simulation which permits the conversion 
of likelihoods to confidence levels which however is currently not available. 

sin2/3 The first measurements for sin2/3 in B decays to CP eigenstates containing char- 
monium from the B factories, CDF and LEP give results which are compatible with both, 
the SM expectation and zero: 



sin2/3 : 



0.34 ± 0.21 (BABAR) [96]) 

0.58 ± 0.34 (Belle) M) 

0.79 ± 0.43 (CDF) [m) 

0.84 ± 0.93 (ALEPH) [agf) 

3.2 ± 2.0 (OPAL) [100|) 



(73) 



From these measurements (asymmetric errors have been averaged) we obtain the weighted 
mean sin2/3wA = 0.48 it 0.16, where the (small) effect from using different Am^ values in 
the single analyses has not been taken into account. 



6.3 Future Prospects: Rare Decays of K and _B-Mesons 

Theoretically clean measurements of CKM matrix elements are obtained by virtue of rare K 
and B decays. The countours in the (p, fj) plane expected from rare K decays are drawned on 
Figg 
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Parameter 



Value lb Error(s) 



Error treatment in i?fit: 
Reierence(s) „ „ Float. 

Gauss, ineo. rrop. 



Vud\ 

Vus\ 

Vub\ 

Vcs\ 



0.97394 ± 0.00089 
0.2200 ± 0.0025 
(3.49 ±0.23 ±0.55) x lO'^ 
0.224 ±0.014 
0.969 ± 0.058 
(40.76 ± 0.50 ± 2.0) x lO'^ 



see text 
see text 
see text 
see text 
see text 
see text 



Arud 

Arrig 

sin2/3wA 



(2.271 ± 0.017) X 10-°3 
(0.487 ± 0.014) ps-i 
Amplitude spectrum 
0.48 ±0.16 



see text 
see text 
1 94 1, see text 
see text 



mt(MS) 
niK 

mw 

Gf 

Ik 



(1.3 ±0.1) GeV 
(166.0 ± 5.0) GeV 
(493.677 ± 0.016) MeV 
(3.4885 ± 0.0008) x lO'^^ GeV 
(5.2794 ± 0.0005) GeV 
(5.3696 ± 0.0024) GeV 
(80.419 ± 0.056) GeV 
(1.16639 ± 0.00001) X 10"^ GeV" 
(159.8 ± 1.5) MeV 



Bk 
rjcc 

r]tt 

r?B(MS) 



0.87 ±0.06 ±0.13 
1.38 ±0.53 
0.47 ±0.04 
0.574 ± 0.004 
0.55 ±0.01 
(230 ± 28 ± 28) MeV 
1.16 ±0.03 ±0.05 



see text 
see text 
see text 
see text 
see text 
see text 
see text 



Table 1: Inputs to the global CKM fit. If not stated otherwise: for two errors given, the first 
is statistical and accountable systematic and the second stands for systematic theoretical uncer- 
tainties. The fourth, fifth and sixth columns indicate the treatment of the parameters within 
i?fit; measurements dominated by experimental errors (or statistical components of theoretical 
parameters) are marked as "Gauss." by an asterisk; parameters dominated by systematic the- 
oretical uncertainties, treated as ranges in R&t, are marked as "Theo. "; for parameters that 
have experimental and systematic theoretical errors, treated in the fit according to Eq. (61_), 
both fields, "Exp. " and "Theo. ", are marked; parameters with small errors, marked as "Prop. ", 
have their uncertainties propagated to the corresponding measurements to whose errors they are 
added in quadrature. The last column indicates whether or not the parameter is floating in the 
fit. In general, measurements with non-vanishing systematic theoretical errors have a floating 
theoretical component. Theoretical parameters with significant errors are necessarily floating. 
Upper part: experimental determinations of the CKM matrix elements. Middle upper part: CP- 
violating and mixing observables. Middle lower part: parameters of the SM predictions obtained 
from experimental data. Lower part: parameters of the SM predictions obtained from theory. 
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Am^ [ps ] 

Figure 6: The measured (upper plot) and likelihood (lower plot) functions (defined in the text) 
versus the frequency of — oscillation, Am^. 



The decay — tv^vv has not been observed yet. The current upper Umit reads 



BRiKl 



TT^ui?) < 5.9 X 10"'^ 



(CL = 90%) 



(74) 



while the expected SM branching ratio is of the order of 2 x 10 The decay proceeds via 
a loop induced FCNC transition at short distance and is greatly dominated by a direct 
CP-violating amplitude in the SM |To|], A{Kl ir^ui?) oc lm[VtdVil]{7r^\{sd)v-A\K^) , 
due to the cancellation of the charm contributions. The SM prediction for the branching 
fraction of the decay reads [|102|, 



Here, rx^^ = 0.944 corrects for isospin breaking effects and the different phase space [104] 
involved in the relation between the and the branching fractions. The other 



parameters in Eq. (75) are the kaon lifetimes, the QED running fine structure constant 
and the Weinberg angle. The Inami-Lim function X{xt) for xt = {mt/mw)'^ is defined as 



Xo{x) 



X + 2 3x 
+ 



6 



8 V^;- 1 (x- 1)- 



ln(x] 



(76) 



for which r]x = 0.994 (Eq. (^)) represents the NLO correction llO|]. In Ref. Il0|] the CP 
conserving contribution to i^L ~^ t^^vv has been found to be suppressed by a factor of 6 x 
10~^ with respect to the CP-violating rate. Expressed in the Wolfenstein parameterization, 
the SM prediction corresponds to 



(77) 
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showing that a measurement would provide a pair of horizontal lines in the (p, r]) plane. 
Proposed experiments that could measure BR(/Cl t^^vv) are the KOPIO experiment 
at BNL |0|] and KAMI at FNAL [|lO^ expecting 60 and 120 events, respectively. The 



experiment(s) will not start before 2005 and have to take data for several years. The 
expected precision on the branching ratio in 2010 is of the order 5% to 10%, thereby 
yielding a precision of a few percent on the measurement of f/. 

— y 7T~^i/v : The SM prediction of the rare decay —>■ tt^uu is given by 



3a2 BR{K+ 7r°e+j/) 



BR(K+ ^ TT^uu) = tk^^ „ . ; " ^ ' \vxXo{xt)VtdV:, + X^^^V,dV, 



(78) 

where the function XQ{xt) is given by Eq. ( [76| ) and where the -^^L terms account for the 



charm contributions (not suppressed here) and are calculated in Ref. jl08| . The values 
depend on the QCD scale and the running charm quark mass: ^^'l ~ ~ ^ 10~^ 
and = (5 — 9) X 10^^. A detailed discussion of the theoretical uncertainties connected 



with this and the above SM predictions is provided in Ref. |103| ]. The BNL experiment 
E787 has observed one event resulting in BR(i^+ vr+z^P) = (1.5t5;|) il09| ]. 



About 5-10 events are expected to be observed by the successor E949 |110|] whereas the 



CKM project at FNAL | 111 |, starting about 2005, expects to collect about 100 events 
within some years of data taking. A similar precision for the branching ratio measurement 
as in the case of — > tt^vv may be achieved in the year 2010. A theoretical uncertainty 



of the order 5% but likely not well below this value might be possible [|103(| . 

The — ^ T^v decay has not been observed yet. The current upper limit for its 



branching fraction reads [10| 



BR(5+ ^ T+v) < 5.7 X 10"'' (CL = 90%) . (79) 
In the SM the branching ratio is given by 

BR(5- ^ r-u) = (1 - ^) , (80) 

with the B meson decay constant fs^ (see Tab. |l]) and the lifetime of the charged B, 
tb = 1.653 ± 0.028 ps |jl^. Depending on the precision of the lattice calculation of fs^, a 
measurement of BR(i?+ t^v) may either yield a direct measurement of \Vub\, or may 
improve the prediction of Am^ through the constraint of /b^ which is the more likely way 
to proceed. Additional information may be obtained by measuring the radiative decay 
B~^ — > i^L'i'y in which the helicity suppression is circumvented due to the emission of the 
photon from the primary u-quark (see, e.g., Ref. [|112f| ). Although the calculation of the 
branching ratio is model dependent a measurement possibly provides a useful experimental 
check of lattice calculations. 

CP-violating Partial Rate Asymmetries (PRA) of inclusive b s{d)'y decays can be 



calculated in the SM |113, nT4, nm, Wm, 117]. They are defined by the ratio 



CP BR(5 - X,(,)7) + BR(5 - X,(,-)7) ^ ^ 
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Their theoretical predictions depend on various Wilson coefficients and CKM matrix el- 



ements involving the CP-violating phase |115| . Lumping all coefficients together, where 
external parameters like the strong coupling constant, the 6-quark mass, the photon in- 



frared cut-ofF and the renormalization scale have to be fixed, gives the estimate [ 117 | 



^CP 



This yields asymmetries of ^^p'^''^ 
the CKM elements. 



0.33 X Im 



0.6% and A' 



ts{d) . 



(82) 



CP 



-16% for some typical values of 



B ttK : Due to the work of many authors (see, e.g., Refs. 1TT|, |rT|, |l2g, |T2l|, |l2|] 
- this list is far from being complete), it could be shown that the (ratios of) branching 
fractions of charmless decays into vr and K final states provide constraints on the UT 
angle 7. Most recent branching ratios read 

(83) 
(84) 
(85) 
(86) 
(87) 

where the values given are the weighted means of the preliminary results on charmless B 



BR(i?o - 


vr+TT") +BR(50 - 


vr^TT ) 


= (4.43 ± 0.89) X 10^^ 


BR(S° ^ 


K+TT-) + BR{Bj - 




= (17.25 ± 1.55) X 10~^ 


BR{B+ - 


> K+7r°) + BR{B~ - 


K-n^) 


= (12.10 ± 1.70) X 10^*^ 


BR(B+ - 


> A"°7r+) + BR(S~ - 




= (17.19 ± 2.54) X 10~^ 


BR(SO 


^ K\^) + BR{Bj - 




= (10.33 ± 2.53) X lO"'' 



decays presented by the BABAR, Belle and CLEO collaborations [123, 124, 125 1 (asymmet^ 



ric errors have been averaged). The authors of Refs. [12C, 121] have obtained predictions 
of relative amplitudes and phases of the tree and penguin diagrams involved in the above 
decays. Very recently, a new theoretical analysis of two-body B decays to pions and kaons, 
based on non-leading Factorization Approximation, has been published [12G]. The authors 
obtain an allowed region for |l^fe|e~*^ which is in agreement with the results found in this 
work. 

A statistical discussion and formulae for the treatment of ratios of branching fractions or, 
more precisely, constraints from parameters with arbitrary absolute, but known relative 
normalization is given in Appendix ^ of this paper. 



We have attempted in this section to recall some of the most striking prospects for future 
CKM constraints which, however, is far from being complete. A more quantitative and broader 
selection of CKM sensitive quantities as well as extrapolations into the future can be found in 
Ref . 1 14 1 . If Atti-s is not much larger than suggested by the current SM constraints it will likely 
be measured during the forthcoming Tevatron II run. The precision of the combined Am^ and 
Attt-s constraints on /), f) will then be dominated by the QCD parameter ^. An experimental 
determination of the decay constant ratio /d^//d<j at a r /charm factory would be helpful to 
check lattice QCD calculations of ^. Measurements of time-dependent CP-asymmetries at the 
T(4S') and at hadron machines, in particular at the forthcoming experiments LHCb and BTeV, 
aim to extract the UT angle a in non-strange, charmless two and three body decays. The 
remaining angle 7 is expected to be determined in B^ DK and Bg — > DgK decays, though 
these measurements require very large data samples of the corresponding -6^(5) mesons. The 
dedicated experiments LHCb and BTeV will also measure the most promising channel Bg — > 
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7 Constrained Fits Within the SM 



After the discussions in the preceding sections, we are prepared to perform the constrained fits 
of the CKM parameters and related quantities. We place ourselves in the framework of the i?fit 
scheme (see Section |^ for an introduction and Section 4.1 for a summary of its main features) 
and hence define the theoretical likelihoods of Eq. (32) to be one within the allowed ranges and 
zero outside^^ As a consequence, no hierarchy is introduced for any permitted set of theoretical 
parameters, i.e., the which is minimized in the fit receives no contribution from theoretical 
systematics, but theoretical paramaters cannot exit their allowed ranges. When relevant, sta- 
tistical and theoretical uncertainties are combined beforehand, as presented in Section (i.e., 
applying Eq. (|39|)). Floating theoretical parameters are labelled by an asterisk in the "Float." 
column of Table For parameters with small uncertainties, errors are propagated through the 
theoretical predictions, and added in quadrature to the experimental error of the corresponding 
measurements^: they are labelled by an asterisk in the "Prop." column of Table |l]. 



7.1 Two Dimensional Parameter Spaces 

It is customary to present the constraints on the CP-violating phase in the two-dimensional 
(p, ff) plane of the Wolfenstein parameterization. Other representations involving the UT angles 
a, (3 and 7 are also considered in the analysis. For the two-dimensional graphical displays we 
define the a parameter space by the coordinates a = {e.g., a = {p, f)]) and the p space 

by the other CKM parameters A and A, as well as the yqcD parameters. 



7.1.1 Metrology in the (p, rf) Plane 

The individual constraints, sensitive to p and f), are drawn in Fig. |^. Shown are the CLs of 
Eq. (^) which, according to the frequentist approach adopted in Mii, have to be interpreted as 
upper hounds for the optimal set of theoretical parameters at a given point in the {p, ff) plane (this 
is implicit in the following when invoking the term CL). Obviously, CLs should not be interpreted 
as PDFs, i.e., inferring equal relative probability density from equal shades. Instead, a CL value 
expresses a probability which is defined for a given coordinate {p,ff\: it is the probability that 
the agreement between data and the most favorable realization of the SM at that point be worse 
than the one observed. However, although the CLs have a well defined statistical meaning, one 
must be aware of their strong dependence on the, to some extent, arbitrary [yqco] ranges, (c./.. 
Section |3. 



The results of the global fit in the (p, ff) plane are shown in Fig. ^ not including (upper plot) 
and including (lower plot) in the fit the world average of sin2/3 (see Table |l]). The dark, medium 
and light shaded areas correspond to > 90%, > 32% and > 5% CL, respectively. The outer 
regions with lower probabilities are outside the yj^^ domain where an adjustment of the p 
parameters can maintain maximal agreement {i.e., can reproduce the Xmin j/mod '^^1^^)- Also 
shown are the 5% CL contours of the individual constraints as well as the > 32% and > 5% CL 



regions corresponding to sin2/3wA (hatched areas). As described in Section 3.2, the CLs obtained 



belong to the metrological phase of the analysis and, by construction, do not constitute a test 

Lmin;ymod 



of goodness of the theory. A probe of the SM is obtained from the numerical value of Xn 



as discussed in Section p. 3 and used in Section 7.3 



In other words, we use k — Q and C = 1 for the Hat function £syst(a^o) of Eq. 
■^^ This procedure neglects the correlations occurring when such parameters are used in more than one theoretical 
prediction. 
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Figure 7: Confidence levels in the {p, f)) plane for the individual constraints. The upper right 
hand plot shows in addition to Am^ the improved constraint from Am^ via on Am^. The 
shaded areas indicate the regions of > 90%, > 32% and > 5% CLs, respectively. 



7.1.2 Other Two Dimensional Parameter Spaces 

Except for possible multi-valuedness problem^, it is straightforward to replace the {p, ff) plane 
by any other one, two or higher dimensional parameter constellation. Figures ^ and 10 show 
the results from the global fits and for the individual constraints in the planes (sin2Q, sin2/3), 
(sin2a, 7) and (sin2/?, 7), respectively. The constraint from sin2/3 does not enter the fits. As 
aforementioned, the individual constraints are given as 5% CL contours, and the shaded areas 
depict > 90%, > 32% and > 5% CL areas. In general, for the (p, ff) plane as well as for 
any parameter spaces, the individual inputs are less constraining than what they yield in the 
combined fit: i.e., a combination of input variables can lead to a suppression of solutions thus 
enhancing the individual constraints. An example for this is drawn in the lower plot of Fig. 
the individual contribution of \eK\ is stronger in combination with iK^b/VJ;;,! (indicated by the 



arrows). The lower plot of Fig. 10 visualises the complementarity between l^h/V^fel and \€k\ 



constraining sin2/3, on one hand, and Am^/Amd and \eK\i constraining 7, on the other hand. 



7.2 One Dimensional Parameter Spaces 

Following the line of the preceding sections we can derive one-dimensional constraints for all 
parameters involved, such as the various CKM parameters, the moduli of the CKM matrix 
elements, branching ratios of rare K and B meson decays as well as theoretical parameters. 
Consequently, we define the parameter we are interested in to be a and all others to be 
(c./.. Section 3.2), and scan a. Numerical and graphical results are obtained for CKM fits not 



including (including) the world average value of sin2/3 (see Table |l| for the input parameters). 



For example, when exploring the (sin2Q:, sin2/3) plane, care should be taken to account for multiple solutions. 
A given value of sin2a; {lu — a or (3) can be obtained with four values of uj [uji — iarcsin(sin2a;), i02 = ^ — uji, 
t03 = TV + UJ1, UJ4 = ^ — LOi) and corresponds to a pair of curves in the (p, -q) plane intersecting on the = axis. 
Each intersection of one of the sin2/3 curves with one of the sin2a curves should be considered. 
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-1 -0.5 0.5 1 

P 




Figure 8: Upper plot: confidence levels in the {p, ff) plane for the global CKM fit. The shaded 
areas indicate the regions of > 90%, > 32% and > 5% CLs, respectively. Also shown are the 
5% CL contours of the individual constraints. The > 32% and > 5% CL constraints from the 
world average of the sin2/3 measurements, not entering the combined fit, are depicted by the 
dashed areas. See Table |^ for a compendium of the fit input values. Lower plot: confidence levels 
obtained when including the world average of sin2/3 in the combined fit. 
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Figure 9: Confidence levels in the (sin2a, sin2/3) plane for the global CKM fit. The shaded areas 
indicate the regions of > 90%, > 32% and > 5% CLs, respectively. Also shown are the 5% CL 
contours of the individual constraints. The > 32% and > 5% CL constraints from the world 
average of the sin2/3 measurements, not entering the global fit, are given by the dashed areas. 



As an example, Fig. 11 shows the CLs obtained for the Wolfenstein parameters, the UT angles 
and the Jarlskog parameter, without (solid line, gray area) and with (dotted line) including 
the world average sin2/3wA in the global fit. As in the two-dimensional case, the CLs shown 
correspond to the most compatible theory for a given point in o. Since the parameter A is not 
significantly constrained by the other inputs, its CL corresponds to the error function for one 
degree of freedom. In contrast, parameters such as, e.g., p, f], are constrained by observables 
whose SM predictions are dominated by systematic theoretical errors. The positions of the 
flanks of the CL functions are determined by the [i/qcd] ranges, whereas their sharp rises are 
determined by statistical errors. Therefore, one should not attribute an absolute meaning to 
the precise locations of the flanks: they are due to the assumptions made to define the [i/qcd] 
ranges. In particular, when the world average sin2/3wA is used, the CL function obtained (c./.. 
Fig. |ll|) exhibits a triangular shape: whereas the fall off on the right hand side of the function 
is well defined, the location of the fiank on the left hand side is somewhat arbitrary and hence 
arguable. 

The results for all relevant parameters considered in this work are listed in Table without 
sin2/3 in the fit, and Table |^ when including sin2/3wA- Given are the ranges for > 32% and > 5% 
CLs in the case of theoretically limited quantities and the corresponding Gaussian errors in the 
case of experimentally limited quantities. The 95% CL allowed ranges for the CKM matrix 
elements are similar to the ones quoted by the PDG. Numerical results involving B — > mr/KTr 
decays are not presented here^. 



It proved however straightforward to implement them into the -Rfit scheme of the CkmFitter package (see 
also the discussion in Section 3.3). 
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Figure 10: Confidence levels in the (sin2a, 7) (upper plot) and the (sin2/?, 7) plane (lower plot), 
obtained from the global CKM fit. The shaded areas indicate the regions of > 90%, > 32% and 
> 5% CLs, respectively. Shown in addition are the 5% CL contours of the individual constraints. 
The > 32% and > 5% CL constraints from the world average of the sin2/3 measurements are 
given by the dashed areas. It does not enter the global fit. 
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Parameter 




> 32% CL 


half width 


> 5% CL 


half width 


A 




0.2221 ± 0.0021 


0.2221 ± 0.0041 


A 




U. ( /U - U.ooo 


U.UiDy 


U. (04 - u.yuo 


U.U/D 


P 




0.08 - 0.35 


0.14 


0.04 - 0.38 


0.17 


V 




0.22 - 0.46 


0.12 


0.21 - 0.49 


0.14 


J (10-5) 




2.0 - 3.5 


0.8 


1.9 - 3.7 


0.9 


sin2a 




-0.91 - 0.34 


0.63 


-0.96 - 0.49 


0.73 






0.50 - 0.86 


0.18 


0.47 - 0.89 


0.21 


a 




80° - 123° 


22° 


75° - 127° 


26° 






15.0° - 29.7° 


7.4° 


14.0° - 31.4° 


8.7° 


7 = (5 




37° - 75° 


19° 


34° - 82° 


24° 


sin6'i2 




0.2221 ±0.0021 


0.2221 ± 0.0041 


sin^i3 (10-3) 




2.70 - 4.31 


0.81 


2.49 - 4.55 


1.03 


sin023 (10-3) 




38.4 - 43.2 


2.4 


37.9 - 43.6 


2.8 


\Vud\ 




0.97504 ± 0.00049 


0.97504 ± 0.00094 


IT/ 1 




0.2221 ± 0.0021 


0.2221 ± 0.0042 


\Vub\ (10-3) 




2.70 - 4.31 


0.81 


2.49 - 4.55 


1.03 


\V J 

\ ''cdl 




0.2220 ±0.0021 


0.2220 ± 0.0042 


\Vcs\ 




0.97422 ± 0.00056 


0.97422 ± 0.00102 


\Vcb\ (10-3) 




38.4 - 43.2 


2.4 


37.9 - 43.6 


2.8 


\Vtd\ (10-3) 




6.6 - 9.2 


1.3 


6.3 - 9.6 


1.6 


\Vts\ (10-3) 




37.7 - 42.8 


2.6 


37.3 - 43.2 


3.0 






0.99907 - 0.99927 


10 X 10-5 


0.99905 - 0.99929 


12 X 10-5 


A / 1 \ 

Anis (ps ) 




15.5 - 33.7 


9.1 


15.0 - 42.0 


13.5 


BR(i^{^ ^ TT^iyj?) 


(lo-^M 


1.3 - 4.0 


1.4 


1.2 - 4.4 


1.6 


BR{K+ -K+vu 


(10-11) 


5.1 - 9.6 


2.3 


4.8 - 10.5 


2.9 


BR(B+ ^ r+zy^) 


(10-5) 


4.6 - 20.0 


7.7 


3.6 - 23.6 


10.0 


BR 


(10-^) 


1.8 - 7.9 


3.1 


1.5 - 9.3 


3.9 


(MeV) 




193 - 271 


39 


184 - 284 


50 


Bk 




> 0.55 


> 0.50 




mt (GeV) 




106 - 406 


150 


93 - 565 


236 



Table 2: Fit results for the various CKM parameters, the CKM matrix elements, branching 
ratios of some rare K and B meson decays and theoretical quantities. Ranges are quoted for the 
quantities that are limited by systematic theoretical errors. The last three lines give the ranges 
obtained for chosen theoretical parameters when removing their respective bounds in the fit. 
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Parameter 






> 32% CL 


half width 


> 5% CL 


half width 


(532 


55 


A 






0.2221 ±0.0021 


0.2221 ±0.0041 








A 






0.782 - 0.888 


0.053 


0.758 - 0.906 


0.074 


10 


3 


P 






0.09 - 0.29 


0.10 


0.04 - 0.37 


0.16 


29 


6 


V 






0.22 - 0.32 


0.05 


0.21 - 0.42 


0.11 


58 


21 








2.0 - 2.9 


0.5 


1.9 - 3.5 


0.8 


oo 


w 


sin2a 






-0.88 - 0.04 


0.46 


-0.95 - 0.33 


0.64 


27 


12 


sin2/? 






0.50 - 0.67 


0.09 


0.47 - 0.81 


0.17 


50 


19 


(y. 






89° - 121° 


16° 


80° - 126° 


23° 


27 


12 


(3 






15.0° - 21.0° 


3.0° 


14.0° - 27.0° 


6.5° 


59 


25 


7 = 5 






42° - 74° 


16° 


34° - 82° 


24° 


16 





sin0i2 






0.2221 ±0.0021 


0.2221 ±0.0041 








sm6'i3 (10-3) 






2.70 - 4.03 


0.67 


2.49 - 4.38 


0.95 


17 


8 


sine23 (10-3) 






38.4 - 43.2 


2.4 


38.0 - 43.6 


2.8 








\Vud\ 






0.97504 ± 0.00049 


0.97504 ± 0.00094 








IK.I 






0.2221 ± 0.0021 


0.2221 ±0.0042 














2.70 - 3.71 


0.51 


2.45 - 4.38 


0.96 


O i 


7 








0.2220 ± 0.0021 


0.2220 ± 0.0042 








\v 1 






0.97414 ± 0.00049 


0.97414 ± 0.00098 


13 


4 


\Vch\ (10-3) 
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1.1 - 3.8 


1.4 


50 


13 


BR(K+ ^ vr- 




) (10 


-11) 6.6-9.5 


1.5 


5.4 - 10.4 


2.5 


35 


14 


BR(S+ ^ 




(10- 


5) 4.6 - 12.4 


3.9 
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(MeV) 
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26 


185 - 272 


44 


33 


12 


Bk 






> 0.72 




> 0.55 




31 


10 


mt (GeV) 






124 - 406 


141 


102 - 550 


224 


6 
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Table 3: Fit results including the world average on sin2/3wA- in Table ranges are given 
for the quantities that are limited by systematic theoretical errors. The two right columns give 
the relative improvements (in percent) of the > 32% CL and > 5% CL half widths with respect 
to the fit results without sin2/3 given in Tab. ^. The last three lines give the ranges obtained for 
chosen theoretical parameters when removing their respective bounds in the fit. 
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Figure 11: Confidence levels for one- dimensional parameter fits of the CKM parameters, UT 
angles and the Jarlskog parameter, not including sin2/? in the fits. The solid (dashed) lines give 
the results with (without) the CP-violating \€k\ as fit input. The fits corresponding to the dotted 
lines include sm2/3wA cind |e/^|. 



7.2.1 Indirect Evidence for CP Violation 

It is interesting to investigate the possibility of an indirect evidence for CP violation, from the 
measurements of non CP-violating observables. The dashed curves in Fig. |l^ give the CLs 
which are obtained when using neither sin2/3, nor {exl in the fits. For f/ = (hence no CP 
violation) i?fit yields CL ~ 50%. Therefore, we find that CP conservation cannot be excluded 
without sin2/3 or |ex|: a better knowledge of iT^ft/Kftl and Am^ (Am^) is needed to draw any 
further conclusions. The large value of CL(f/ = 0) stems from the fact that the quoted CLs are 
upper bounds. There exist realizations of the SM, with = and with all ygcD parameters 
within their allowed [yqcD] ranges, which provide a perfectly acceptable description of data 
(without {exl and sin2/3). The realizations of the SM which yield the best agreement are chosen 
to compute CL(f/ = 0). 

This result is not in qualitative agreement with the one obtained in Ref. |Q: this illustrates 
how widely different conclusions can be reached depending on the choice made for the statistical 
treatment. The Bayesian approach, while computing CL(f] = 0), is incorporating in passing, 
through the use of PDFs and Eq. (^6|) , the " volume" of the domain in the yqcD space (weighted 
by the theoretical PDFs) where realizations of the SM are in agreement with data. The "volume" 
of this yqcD domain is small, as a result CL(?7 = 0) is small. The frequentist i?fit scheme does 
not consider PDFs for yqcD parameters as a valid concept, it thus cannot define "volume" of 
a domain in this space deprived of metric: only the best realizations are retained to define the 
CL. 



An expanded view of the R&t CL(r/) function is shown on the left hand side of Fig. 12 in the 
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Figure 12: The R&t CL(f/) function (left hand side) and the Bayesian PDF of fj (right hand 
side) obtained using the same inputs. The solid (resp. dashed) line is obtained using Gaussian 
(resp. uniform) PDFs for the yqcD parameters. 



range [—1,+!]. Since no observable sensitive to CP violation is incorporated in the fit, CL(ry) 
is an even function. The Bayesian PDF of f) {c.f., Eq. (|56|)) is shown on the right hand side of 
the same figure. The sohd (resp. dashed) line is obtained using Gaussian (resp. uniform) PDFs 
for the uqcT) parameters. One observes that, independently of the definition used to derive a 
CL from the PDF, both Bayesian CLs will be low (at the percent level if one uses Eq. (|57|)) 
and most notably the one obtained from the uniform PDFs, although the inputs to the fit are 
identical to the ones used by -Rfit. 



7.2.2 Impact of the New sin2/3 Measurements 

The measurement of sin2/3 provides the UT angle /3 up to a four- fold ambiguity. To illustrate this, 



we have enlarged the borders of the {p, fj) plane in Fig. 13. Shown are the individual constraints 
and the result from the global fit corresponding to Fig. as well as the four solutions from the 
world average sin2/3wA • It is a non-trivial outcome of the SM fit that it leads to an exclusion of 
three out of the four ambiguities. 



The confidence levels for the sin2/3 measurements of BABAR and Belle |97] together with 



the world average^ and the result of Riit (without sin2/?) are shownp^ in Fig. 14. Given in 
addition are the results of the integrated PDFs obtained in the Bayesian analysis, when ascribing 
Gaussian or uniform PDFs to the systematic theoretical errors. While they significantly differ 
in the precision they claim for, the -Rfit and Bayesian indirect determinations of sin2/? are both 
compatible with the world average. 

The last two columns of Table ^ give the relative improvements (in percent) of the parameter 
constraints gained by including sin2/3wA in the CKM fit: the two quoted numbers ^32 and 5^ refer 
respectively to the ranges allowed at 32% and 5% CLs. All quantities sensitive to CP violation 

As stated before, the measurements are assumed to be Gaussian distributed. Therefore, the CLs given are 
direct confidence levels and not upper bounds, as one obtains when theoretical systematics contribute significantly 
to the uncertainty of a quantity. 

The plateau of the CL function obtained from i?fit corresponds to sin2/3 values belonging to the y^^^j domain. 
The CL on the plateau is not exactly equal to unity. The slight slope which is observed is due to the |Vcd| input: 
being a function of p and rj, and having a statistically dominated uncertainty, \Vcd\ lifts by a very slight amount 



the degeneracy discussed in Section 3.2 
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Figure 13: Confidence levels in the enlarged (p, fj) plane for the global CKM fit. See Fig. |^/or a 
description of the curves shown. The > 32% and > 5% CL constraints from the world average 
of the sin2/3 measurements, not entering the combined fit, are depicted by the dashed areas. All 
four ambiguities are drawn, three of which are excluded by the Standard Model. 

benefit from significantly smaller 32% allowed ranges, with a relative reduction of up to 50%. 
This reduction however gets suppressed when going to 5% CLs. This reduced improvement is 
explained by the fact that a significant fraction of 5% CL limits remain determined by the border 
of the y^oj domain, as can be seen on Fig. 11. Although the Gaussian shape of the additional 
constraint from sin2/3wA leads to significant structures within the allowed ranges of CLs, the 
[^QCd] ranges still determine part of the CL function tails. 



7.2.3 Numerical Comparison With Bayesian Results 

As discussed previously, the Bayesian treatment identifies experimental and theoretical likeli- 



hoods as probability density functions which are folded according to Eq. (pq) (c./., Section 4.4, 
Appendix^, and also Refs. ||6|, 0). In practice, the convolution integrals are solved within the 
CkmFitter package using Monte Carlo techniques generating some 10^ test samples. All input 
quantities fluctuate according to Gaussian distributions, using their statistical experimental un- 
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Figure 14: Confidence levels of the recent sin2/3 measurements of BABAR and Belle [9j_] 
together with the world average. These CLs are compared to the indirect SM constraints obtained 
from Rfit in this work (solid line). Also shown are the results from the integrated PDFs obtained 
in the Bayesian analysis using Gaussian and uniform PDFs for the systematic theoretical errors. 



certainties cr^xp, and to the PDFs ascribed to systematics, characterized by their systematical 
uncertainties dsyst- For the sake of simphcity, two choices are discussed here for the latter PDFs: 
all are taken to be Gaussians with standard deviations identified to Usyst, or all are taken to be 
uniform distributions with half-widths identified to Usyst- In the latter case, one does not identify 
the half-widths to -v/Sfsyst in order to use PDFs which (naively) would lead the Bayesian ap- 
proach to yield results the closest to the one of the -Rfit scheme. Because of that, in the uniform 
case the RMS is smaller than in the Gaussian case: one thus expects to claim for significantly 
smaller uncertainties for the former choice than for the latter choice. A comparison of the results 
for both choices is given in Table ^. The central values quoted correspond to the mean values of 
the resulting PDFs, while the errors are their RMS (i.e., when present, asymmetric errors have 
been averaged). The corresponding ranges provide a good approximation of the 68% confidence 
intervals which can be defined from an explicitly asymmetrical integration of the PDFs, since 
most of them closely resemble Gaussian distributions. As expected, the uncertainties are larger 
for the Gaussian choice. We observe a factor of about 2.2 (resp. 2.8) for the ratio between the 
> 32% CL intervals of i?fit (Table |2|) and the Bayesian ranges, for the Gaussian choice (resp. 
the uniform choice). For the Gaussian choice, this ratio reduces to about 1.3 for the > 5% CL. 

Figure |l5| provides a graphical comparison between the Riii result (the broad solid curve) and 
the Bayesian results (the dashed-dotted and dotted curves) on sin2/3. Shown in addition is the 
result obtained when asymmetrically integrating the output PDF obtained from the Bayesian 
analysis of Ref. |^] (the narrow solid curve) where mostly uniform PDFs were chosen for the 
dominant theoretical uncertainties. The plot visualizes the tendency observed in Table |^ for the 
tails of lower CLs from Rfit and the Bayesian approach to evolve towards comparable uncertainty 
ranges. However, the curves do not converge the ones to the others: the various treatments do 
not provide identical results, even for very low CLs. 
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Parameter 


Gaussian 


Uniform 


AFreq./ABayes(Gauss) 
> 32% CL > 5% CL 


A 


0.2219 ±0.0021 


0.2219 ±0.0021 


1.0 


1.0 


A 


0.832 ± 0.040 


0.830 ± 0.028 


1.5 


1.0 


P 


0.217 ±0.063 


0.203 ± 0.048 


2.2 


1.3 


V 


0.331 ± 0.056 


0.330 ± 0.039 


2.1 


1.3 


J 


(2.70 ±0.36) X 10-5 


(2.70 ±0.25) X 10-^ 


2.2 


1.3 


sin2a 


-0.32 ±0.30 


-0.30 ±0.24 


2.1 


1.2 


sin2/3 


0.710 ±0.093 


0.705 ± 0.065 


1.9 


1.1 


7 


57.0° ± 8.7° 


58.5° ± 7.0° 


2.2 


1.4 


Bk 


(230 ± 27) MeV 
0.91 ±0.12 


(227 ± 13) MeV 
0.89 ± 0.08 


1.4 


0.9 



Table 4: Results for the CKM parameters, the UT angles and theoretical parameters, using 
Bayesian statistics with Gaussian (second column) or uniform (third column) distributed prob- 
ability density functions for the systematic theoretical part of the input parameters. Note that 
asymmetric errors have been averaged. The constraint from sin2/3 is not used. The fourth and 
fifth columns give the ratios of the half widths of the > 32% and > 5% CL Jifit error intervals 
of Table ^ to the Bayesian errors given for the Gaussian case in the second column. 

7.3 Probing the Standard Model 

We have seen in the introduction that the metrological phase is intrinsicahy unable to detect 
a failure of the SM to describe the data. The interpretation of the test statistics Xmin;y^ad 
performed by means of a toy Monte Carlo simulation as described in Section The fits in 
the previous section yield for the point of best compatibility 

xLn;,„,o, = 2.3 (2.4) , (88) 

for the data set without (with) sin2/3wA- We now generate the distribution J^(x^) of Xmm;y^od 
fluctuating the measurements and parameters according to their non-theoretical errors around 
the theoretical values obtained using the parameter set y"^^^ for which is obtained Xmin;j/mod' 
The resulting toy distributions are shown by the solid (with sin2/3vvA in the fit) and dashed 
(no sin2/3) histograms in Fig. |l^. Integrating the distributions according to Eq. ( |50| ) yields the 
corresponding CL (smooth curves in FigH). We find 

V{SM) < CL(xLn;w) = 69% (71%) , (89) 

for the validity of the SM without (with) sin2/3vvA- Repeating the study using the sin2/3 measure- 
ment of BABAR (Belle) instead of the world average, gives confidence levels of ^^iXmm-y^od^ ~ 
59% (77%) for the vahdity of the SM. 

8 Supersymmetric Extensions of the Standard Model 

Having considered both metrology and probing the SM with the present data set, one is now led 
to attempt an example analysis within an extended theoretical framework. This section aims to 
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Figure 15: Comparison between Hfit (broad solid curve) and Bayesian fits for the indirect CKM 
constraint on sin2/?. The lower plot displays the identical curves as in the upper plot but in 
logarithmic scale. For the Bayesian fits: Gaussian (uniform) systematic theoretical PDFs are 
depicted as dashed-dotted (dotted) curves. Shown in addition is the (integrated) result obtained 
in the Bayesian analysis of Ref. ^ ( narrow solid curve ) for which mostly uniform PDFs were 
chosen for the dominant theoretical uncertainties. 



illustrate the search for specific new physics within a simple, predictive supersymmetric (SUSY) 
extension of the SM. 

There exist a considerable number of SUSY models in which new phases appear in the coupling 
between supersymmetric and SM fields. However, these models remain unpredictive as long as 
the additional phases are unconstrained. We therefore cannot forecast how the shape of the UT 
is affected by the new fields. As a starting point, one can use restrictive assumptions which 
lead to more predictive models. In particular, one may only retain models which do not involve 
additional CP violating phases, so that flavour-changing processes are described by the same 
quark flavor mixing matrix V as in the SM. Supersymmetric contributions to the transitions 
between the down-type quarks {b ^ s, b ^ d, s ^ d) are then proportional to the SM CKM 
matrix elements. This restriction defines the category of the so-called Minimal Flavour Violation 
(MFV) models which comprise some variants of the Minimal Supersymmetric Standard Model 
(MSSM), as well as the Two Higgs Doublet Models. 

The MSSM has been extensively studied in the literature, and next-to-leading order (NLO) 



corrections to the SM have been calculated |127, 128]. In this framework, the SUSY correction 



to neutral K and B meson mixing can be described by a single parameter which scales with the 



Inami-Lim function (36) of the top-quark loops in the box diagrams | 130 |: 

S{xt)^Sixt)il + f) , (90) 
leading to the following modified expressions 

Amrf(MSSM) = Amd{SM)[S{xt)^ S{xt){l + fd)] , (91) 
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Figure 16: Simulated J'{x'^) distributions and corresponding CLs not including (solid lines) 
and including sin2/?wA (dashed lines) in the fit. Indicated by the arrows are the corresponding 
minimal Xmin-v found in the analyses. 



Am,(MSSM) = Am,(SM) [S{xt) ^ S{xt){l + /«)] , (92) 
le^KMSSM) = \eK\iSM)[Sixt)^Sixt)il + f,)]. (93) 



As pointed out in Refs. [Il29| , |130|1 , the parameters fd, fs and belong to the same subprocesses 
so that the equahty 

f^fd = fs = fe, (94) 

holds in general. The numerical value for / is assumed to vary within the range 0.4 </ < 
0.75 [13C], while other authors find broader ranges —0.4 < / < 5.5 J129| , ^, |131| , depending 
on whether or not Supergravity constraints (i.e., additional relations between masses and other 
terms of the MSSM Lagrangian) are applied to the MSSM. 

The constraint from |e/<| in the p — f) plane follows the form of a hyperbola [129] 

^"(i-p-)4.)+Pc' 

where Pc = r]ctS{xc, xt) + iJccSixc) stems from the charm loop contribution for which the SUSY 
contribution is expected to be small. The neutral B mass difference Am^ measures the side Rt 
of the UT (HI), where: 



so that SUSY will reduce Rt in case of positive /. Using the above formulae, one readily derives 
the dependence of sin2/3 on the SUSY parameter / |12g|] 



2fj(l — n) 

sin2/3 = ^' oc (const - r^P^) • (97) 

The first, constant term dominates by a factor of two to three the second term, while SUSY 
modifies the second term via the parameter f\ only. Note also that if the charm contribution 
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Figure 17: Confidence level hounds of the global fit versus the MSSM offset f (solid line). The 
ledge at / ~ 1 starts at the upper frequency limit to which Arug amplitudes have been measured. It 
continues as a plateau due to the theoretically limited Am^ information and eventually decreases 
statistically. Also shown are the estimated constraints from additional measurements of Anig 
(CDF, DO) (dashed line), sin2/? (dotted line) andsm2a (dashed- dotted line) (bothBABAR, Belle) 
for the year 2005 !h 



to — mixing were negligeable, SUSY effects would be totally absent in sin2/?. It follows 
from this that 7 constitutes the most sensitive UT angle to the SUSY contribution /. 



8.1 Supersymmetric Fits 

The above SUSY parameterization has been included in CkmFitter and constrained fits are 
performed by setting a = {/} and to all other parameters Vmod- The resulting confidence level 



bounds for the global CKM fit is shown in Fig. 17 from which one obtains for the present data 
set the upper limit 

/ < 5.2 (95% CL) . (98) 

Also shown in Fig. |l^ are the improved constraints from future measurements assumed to yield 
Arus = (17.0 ± 0.9) ps'^ (CDF, DO), sin2^ = 0.77 ± 0.03 and sin2a = -0.32 ± 0.20 (BABAR, 



Belle) (see also the more detailed discussion about future precision measurements in Ref. |ll4| ). 
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9 Conclusions 



We present a new approach to a global fit of the CKM matrix. It is denoted i?fit and is based 
on frequentist statistics. We emphasize the thorough statistical definition of the method and 
discuss differences from Bayesian statistics ^ and from the 95% CL Scan method [^. The 
choice for the fit input parameters, their values and errors are discussed to some detail; in cases 
of doubts we favor the more conservative estimates. The CKM analysis is formally subdivided 
into three distinct phases: a metrological phase in which the Standard Model is assumed to be 
valid and confidence levels for the parameters are computed; a probing phase addressing the 
issue of the validity of the Standard Model description of data; a probing phase for new physics 
relying on predictive parameterizations. For the first phase of the analysis, graphical results are 
displayed in several one and two dimensional representations and numerical results are given for 
relevant CKM parameterizations, CKM matrix elements, Standard Model predictions of rare K 
and B meson decays, and selected theoretical parameters. For the parameters related to the CP 
violating phase of the CKM matrix, we find for the different parameterizations (the fit includes 
the present world average of sin2/3 measurements) 



J = 


(1.9 - 


3.5) X 10 


p = 


0.04 - 


0.37 , 


V = 


0.21 - 


0.42 , 


sin2a = 


-0.95 


- 0.33 , 


sin2/3 = 


0.47 - 


0.81 , 


a = 


80° - 


126° , 


P = 


14° - 


27° , 


7 = 5 = 


34° - 


82° , 



where the > 5% confidence level ranges are quoted. The second phase of the analysis provides 
an upper bound for the validity of the Standard Model, 

V{SM) < 71% . 

A simple predictive supersymmetric extension of the Standard Model has been studied in the 
third analysis phase. 
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A Critical Issues of the Bayesian Approach 



The Bayesian approach injects in the analysis pieces of information in the form of probabihty 
density distributions for the ygcD parameters. The -Rfit scheme proposed in the present paper 
advocates a non-Bayesian approach because most theoretical uncertainties on the ygcD param- 
eters do not stem from statistical fluctuations. The yqcD parameters are not random variables 
following probability density functions: there are poorly known, but fixed, parameters. The 
following examples serve the purpose to illustrate in a simplified framework the impact of using 
the Bayesian approach. 

Let Xi denote M vqcd parameters taking their values in identical allowed ranges [xi] = [—A, +A]. 
These Af yqcD parameters are assumed to combine to form , the theoretical prediction for 
an observable, as follows^ 

= n (99) 

1=1 

The theoretical prediction Tp^^ enters in the analysis as an unique yqcD parameter, which 

• within the -Rfit scheme, is characterized by its allowed range 

[T^-^^] = [-A^, +A^] , (100) 

• within the Bayesian approach, is characterized by its PDF 

p{T)= J ...J l[dxiG{x^)6{T-4^'^) , (101) 

where we assumed that identical PDFs, G{xi), were attributed to the Xi quantities. 

Independently of the details of the shape of G{x), if this PDF is non-zero at the origin, p{T) 
will exhibit a singularity at T = with leading term (— InT)*--^""^^. Hence, the larger M is, 
the more pronounced the peak at the origin of p{T) is. If one chooses a uniform distribution 
G{x) = 1/(2A), that is to say the PDF the "closest" to the Riit assumptions (albeit being a 
fundamentally different object), one obtains a striking result. For M = 1, the Bayesian approach 
and the RUt scheme are equivalent (though not for the associated CLs), insofar no other variable 
is involved: both state that the value of Tp^ is simply within the allowed range [— A,+A]. 
However, though originally no Xj values were favored, the Bayesian approach departs drastically 
from i?fit as soon as AA > 2: it states that Tp is most likely close to zero. In effect, when 
the number of ygcD parameters entering the computation of the theoretical prediction increases, 
and hence when our knowledge of the corresponding observable decreases, the Bayesian approach 
claims the converse. 

This is less a consequence of the initial ad hoc choice for the PDF G{x), than a consequence 
of the inescapable properties of Eq. ( [LOlD , when applied to a product of terms. For instance, 

This is not an academic exercize: products of yqcD terms are not rare in the theoretical predictions. For 
example, the Arud expression (c./., Eq. ( |70[ ) of Section 
the choice of [xi] ranges containing the origin is not met in practice. This choice is made here to highlight the 
difference between the _Rfit scheme and a PDF-based Bayesian approach. Ifowever, the singularity discussed below 
is present when Gaussian PDFs are used, as a matter of principle, whether or not [xi] ranges contains the origin. 



1.2) involves the product of tjb, f%. and Bd- However, 
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Figure 18: Convolution of the sum Tg = xi + X2 + ^3 (left hand plot) and the product Tp = 
X1X2X2, (right hand plot) of Af = 3 j/qcd parameters as defined in the text. Plotted is the PDF 
p{T) obtained from Bayesian statistics using for the G{x) PDF a uniform (solid lines, A = ^/3) 
or a Gaussian ( dashed lines, a = 1) distribution. Both PDFs p{T) of Tp present a singularity 
at the origin which is not shown. The Rfit ranges of T^^^ and Tp^ are indicated by the arrows 
located in both instances at ±3^/3. 



the peaking effect remains present for a sum of terms, but it is far less pronounced. The above 
generaUties are illustrated, for = 3 and A = \/3, in Fig. 18. The choice = 3 is made because 



(3) (3) 

the resulting allowed ranges for the product Tp = X1X2X3 and for the sum T^ = xi + X2 + xs 

(3) 

are identical when using Rfit, namely [— 3A,3A]. The figure shows the Bayesian PDF of Tg 
(left hand side) and Tp (right hand side) assuming for G{x) either a Gaussian distribution of 
standard deviation a = 1 (the solid lines) or a uniform distribution in the range [— \/3, +\/3] 
(the dashed lines) for the three parameters x = Xi, i = 1,2,3. The latter case is the closest to 
the R&t scheme for which the allowed range is [T] = [-3\/3,+3\/3], indicated by the arrows m 
Fig. |l^. For both, the sum Ts and the product Tp, the uniform and the Gaussian PDFs yield 
similar p(T) distributions, because their RMS are chosen identical. The still noticeable difference 



between the two G{x) PDFs is damped away by Eq. ( 101 ), even for such a moderate number 
like M = 3. As a consequence, when following the Bayesian approach, it is not a particularly 
conservative choice to adopt a uniform PDF instead of a Gaussian PDF. 

It is a remarkable difference between the two treatments that, whereas within the Rfit scheme 
the two theoretical predictions Tp and Ts are genuinely indistinguishable, they yield sharply 
different PDFs in the Bayesian approach. This shows that, even more than the ad hoc choice 
made to ascribe PDF to the yqcD parameters, it is the functional dependence of the theoretical 
predictions in these parameters, and the interplay between the various theoretical predictions 
within the complete CKM analysis, which play the central role. The attractiveness of the 
Bayesian approach, offering a straightforward procedure for analyses, is fallacious. The deep 
implication of ascribing PDFs to non-random variables is hidden inside an apparently innocuous 
convolution, the outcome of which reflects, more than anything else, the mathematical structure 
of the problem at hand. 
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B Critical Issues of the 95% CL Scan Method 



The 95% CL Scan method 4.2 presents several unwelcome features which are reviewed below. 



Drawing features: Its final graphical result depends strongly on the choices made for 
CLcutj the selection threshold used to retain models, and for CLcontj the constant defining the 
contours to be drawn. It is not possible to infer how a given drawing is modified by changing 
these choices. Regions of weak statistical confidence, just barely passing the selection threshold, 
and thus retained, bear the same graphical weight than regions of fair CL. Similarly, regions of 
lower CL, barely not passing the selection threshold, and thus not retained, are ignored. 



Contour features: The relevance of drawing the contours is not obvious. Whereas a given 
contour has a clear-cut meaning, the message it carries is to show how precise the determination 
of a would be, in the academic situation where all theoretical uncertainties would supposedly be 
resolved. These contours are not ellipses in general. For instance, in the case of the measurement 
of sin2/3 alone, they are built from straight lines (c./.. Section 3.2.3). They can take complicate 
shapes, depending on the choice made for the a variables, and depending on the presence of 
sizeable secondary minima. In simple situations, using the 95% CL Scan method yields awkward 
results. For instance, in the first example given in Section 3.2.3, where a standard situation is 
considered, the method would conclude by a set of short intervals in a, each obtained for a fixed 
2/QCD value instead of providing the overall allowed interval, i.e., applying the -Rfit scheme. 



Envelope features: In the a space no information can be carried about possibly more or 
less favored regions: a point is either within the envelope of the countours, and thus acceptable, 
or outside, and thus not acceptable. The envelope of the contours can be unstable with respect 
to change of CLcut- Indeed, the envelope can be a discontinuous function of this parameter: 
if lowering CLcut allows an outsider model to be selected outside of the envelope, this outsider 
model surfaces in the a space with its full contour and thus lead to an abrupt change of the 
envelope. The envelope provided by the 95% CL Scan method tends to be over-conservative. 
For the example of Section 3.2.3| , the method yields for the envelope of the 95% CL intervals: 



a± = a° ± 1.96 . ""^"^ ^ (c + Vl^) , (102) 
V 1 — ^ 



which is always larger than the correct result given by Eq. ([45D (the two limiting cases c ^ 
and c — > 1 excepted) . 



C Comments on Statistics of Normal Ratios 

Ratios of branching fractions of rare b ^ u transitions have attracted much attention in recent 
theoretical and experimental analyses, by virtue of their potential to constrain the unitarity 
angle 7 (see Section ^j^). It is shown in this section that the extraction of physical observables 
out of ratios of normally (Gaussian) distributed quantities, e.g., branching fractions, requires 
some precaution. 



C.l A Numerical Example 

The statistical discussion will be accompanied by a numerical example for rare charmless B 
decays. It is assumed that branching ratios are measured via the relation BR^ = Nz/{€z o'C), 
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where the reconstruction efficiencies of the two considered final states, z = x,y, shall be ex = 
Ey = 13%, the collected integrated luminosity >C = 20 fb^^, and the production cross section 
a = 1.1 nb. The "measured" branching fractions 

BR^ = (16.0 ± 2.4) X 10"^ , 

BRy = (8.0± 1.7) X 10"\ (103) 

correspond thus to 46 and 23 detected signal events in the channels x and y, respectively. The 
number of events is sufficient to escape from Poissonian to normal PDFs for the branching 
fractions^. To derive constraints from the above measurements one needs a predictive theory 
which, in our example, shall be given by the expressions 

BR.,thco(7) = |F(0)A,.(7)|' , 

BRj,,thco(7) = |F(0)Aj,(7)|2 , (104) 

with the "form factor" -F(O), being identical for both final states, and where the "amplitudes", 
which are functions of the "angle" 7, shall read 

Ay{-i) = l-e'^ . (105) 

Our theory depends on the external parameters -F(O) and 7. The latter provides an example 
of a quantity we are interested in (a = {7}), while the modulus-squared of the first is an 
example of a yqcD parameter. In the framework of -Rfit, |-F(0)p can be eliminated according to 
Section |3.2.2| , using Eq. ([39|), as illustrated in Section It can also be eliminated by taking 
the ratio i?theo(7) = BR2.,theo(7)/BRy,theo(7) = l^a;(7)P/l^y(7)P- Although it appears more 
straightforward to use the ratio than the iZfit treatment, it is shown below that the converse is 
true: using the ratio leads to cumbersome formulae. 

C.2 Probability Density Functions 

We define a set of two statistically independent measurements 

X = xq±cJx , (106) 

y = yo±ay , (107) 

obeying normal distributions 

G{z,zo,a,) = ^=^exp ( J'-'f ] , (108) 



with z = x,y. For the ratio 



y 

the marginal PDF obtained from error propagation 



R=- , (109) 



PiiR) G \ R, Ro, Ro\m + ^ \ , (110) 



y2 



■^^ As ratios of branching fractions are the subject of discussion here, the difference between normal and Poisso- 
nian statistics is much reduced in the resulting PDF, so that the results obtained in this section remain approxi- 
mately valid also for a low number of signal events. 
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where Rq = xq/i/q, is only a coarse approximation of the exact solution 



MR) = 

—oo — oo 

= G{0,Xo,crx)G{0,yo,cry) 



G{x,xo,ax)G{y,yo,ay)dxdy , 
2 



viR) 



l + y^eCR)e«(«) Erf (^^ecR)) 



Here, the functions ^ and rj are defined as 



r}{R) 



1 



J// 



-^B? H r- . 



Equations (110) and (111) satisfy 

Po,iiR)dR = 2-1 po,iiR)dR 



Ro 



1 . 



(Ill) 



(112) 



The densities po{R) and pi{R) are plotted in Fig. |19| for the example of the previous section. 
The maximum of po{R) is shifted from the naive expectation, Max/j{pi(i?)} = pi{Rq), to lower 
values, while its mean value is larger than the naive mean: (po) = 2.10 > (pi) = 2. The 
root mean square (RMS) of the correct solution exceeds the one of the naive approximation: 
RMS(/9o) = 0.62 > RMS(pi) = 0.51, so that the use of (|110|) will tend to over-optimistic results. 



C.3 Confidence Levels 

In our example, the analysis of the branching fractions, or of their ratio, aims at constraining 
the physical quantity 7. Figure ^ shows in its upper plot the theoretical ratio i?theo(7) versus 
7, together with the asymmetric, one standard deviation error band (using Eq. ( |116D ) of the 
"experimental" value. Without loss of generality, we may assume in the following that, apart 



from 7, the theoretical predictions (104) and ( |105| ) suffer only from the theoretical uncertainty 
on F(0), which shall be maximally unknown: i.e., we do not assume that |-F(0)p takes its values 
within an a priori finite range. An approach to obtain confidence levels for 7, that is often met 
in the literature, is to eliminate the factor F{0) by using the ratio (|109| ) and defining the as 



2 



X-f(7) = ( ^^^^^^) , (113) 



where aR is the RMS of pi (see Eq. (|110| ) ) . The associated CL is then given by the cumulative 
distribution of a normal PDF 

CLi(7) = Erfc(^y^X?(7)/2) , (114) 



shown versus 7 by the dotted line in the lower plot of Fig. 20 



Yet, we have seen that the PDF pi is only an approximation of the correct PDF po and hence, 
one is tempted to build a more accurate by means of 

X^„(7) = -21n/5o(i2theo(7)) , (115) 
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R 



Figure 19: Probability density functions po{R) (solid line), pi{R) (dashed line), and the like- 
lihood pnim{R) (dotted line), where R = x/y. The numerical values are those chosen for the 
example (see text). The distribution of pi{R) is symmetric by construction. The likelihood pmin 
is normalized in such a way that its maximal value (not its integral) is unity (see Eq. ( 12(\) ). 



where one may wish to build the corresponding CL, CLoa(7), via Eq. ( |114| ). However, this again 
constitutes an approximation since the error function assumes a normal PDF. The dotted curve 



in the lower plot of Fig. |20| shows the CL corresponding to Eqs. (|llq ) and ( |114D as a function 



of 7. Indeed, the correct CL is obtained via an asymmetric integration of the PDF pq: 

2 / po{R')dR' , Vi?(7) < Ro 

CLo(7) = <! (116) 
2 / po{R')dR' , Vi?(7) >i?o 

Rii) 



plotted versus 7 as solid line in the lower plot of Fig. 20 



The complication of Eq. (116) can be readily circumvented when not explicitly using the ratio, 



but keeping the original branching fractions in the definition of the x^'- 

x^(7, .qcd) = ( ""^^""J^-^^^'^ y + ( y-y<^--^\^y(^n ' . (117) 



Applying Eq. (|39|), hence eliminating yqcD, yields the minimum 

, (x-yW7))! ^^g. 

which, by construction, only depends on i?theo(7)- The CL obtained using Eq. ( 114 ) 



CL,nin(7) = Erfc (v/xLn;,QCD(7)/2) , (119) 
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Figure 20: Upper plot: "measured" ratio of branching fractions within its asymmetric, one stan- 
dard deviation error band (grey zone), corresponding to the example (105), and the "theoretical 
prediction" (IjlO^) ( solid curve) versus 7. Lower plot: confidence levels obtained from Eqs. \11S ) 
(dotted line), jjll^ ) (dashed line) (both using Eq. ( II4 )), and ^11 (\ ) (solid line), the latter being 
identical to Eq. ( 11^^ . 



is identical to the one obtained using Eq. ( |116D . The likehhood corresponding to Eq. 

Pmm(i?theo(7)) =e 2>^— ;.QCD^^^ , (120) 

is shown as the dotted curve in Fig. The hkehhood is equal to unity for i?thco(7) = x/y. It 
is worth emphasizing that Pmin(-R)) which yields the correct CL, when using Eq. dTTlD , is not 



identical to po{R), although the latter is the correct PDF, which yields the correct CL, when 
not using Eq. but Eq. ( |Tl^ ) instead. 
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